Patent application title: Methods and compositions for detecting colon cancers
Inventors:
Sanford D. Markowitz (Pepper Pike, OH, US)
Sanford D. Markowitz (Pepper Pike, OH, US)
Wei-Dong Chen (South Euclid, OH, US)
IPC8 Class: AC12Q168FI
USPC Class:
435 6
Class name: Chemistry: molecular biology and microbiology measuring or testing process involving enzymes or micro-organisms; composition or test strip therefore; processes of forming such composition or test strip involving nucleic acid
Publication date: 2009-03-05
Patent application number: 20090061453
Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
Patent application title: Methods and compositions for detecting colon cancers
Inventors:
Sanford D. Markowitz
Wei-Dong Chen
Agents:
ROPES & GRAY LLP
Assignees:
Origin: BOSTON, MA US
IPC8 Class: AC12Q168FI
USPC Class:
435 6
Abstract:
This application describes methods and compositions for detecting and
treating HLTF-associated neoplasia. Differential methylation of the HLTF
nucleotide sequences has been observed in HLTF-associated neoplasia such
as colon neoplasia.Claims:
1-34. (canceled)
35. A bisulfite-converted methylated HLTF nucleotide sequence selected from the group consisting of:a nucleotide sequence of any one of SEQ ID NOs: 5-8 or a fragment thereof;a complement of any one of SEQ ID Nos: 5-8 or a fragment thereof; anda nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence of any one of SEQ ID NOs: 5-8.
36. A bisulfite-converted methylated HLTF nucleotide sequence selected from the group consisting of:a nucleotide sequence of any one of SEQ ID NOs: 22-25 or a fragment thereof;a complement of any one of SEQ ID Nos: 22-25 or a fragment thereof; anda nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence of any one of SEQ ID NOs: 22-25.
37. The oligonucleotide primer of claim 47, wherein said primer has a sequence selected from the group consisting of SEQ ID NOs: 11-20, 26-38, and 40-43.
38. A kit for detecting an HLTF-associated neoplasia in a subject, comprising at least two primers of claim 37.
39. The kit of claim 38, further comprising a compound to convert a template DNA.
40. The kit of claim 39, wherein the compound is bisulfite.
41. The kit of claim 40, wherein each primer comprises at least a CpG dinucleotide.
42. A method of converting a nucleic acid sequence at least 95% identical to and one of SEQ ID NOs: 4 and 21 to a bisulfite converted sequence comprising:a) providing a nucleotide acid having a nucleotide sequence as set forth in any one of SEQ ID NOs: 4 and 21; andb) adding a bisulfite compound;whereby the unmethylated cytosine bases of the CpG islands are converted to a different base.
43. The method of claim 42, wherein the unmethylated cytosine is converted to a uracil.
44. A nucleic acid sequence as prepared by the method of claim 42.
45. An isolated or recombinant methylated HLTF nucleic acid, comprising a nucleotide sequence of any one of SEQ ID NOs: 2-4 and 21, wherein the cytosine of the CpG island is methylated.
46. An isolated or recombinant HLTF nucleic acid, selected from the group consisting of:a nucleotide sequence of any one of SEQ ID NOs: 2-4 and 21 or a fragment thereof;a complement of any one of SEQ ID Nos: 2-4 and 21;a nucleotide sequence that hybridizes under stringent conditions to a nucleotide sequence of any one of SEQ ID NOs: 2-4 and 21;a nucleotide sequence that is at least 98% identical to the nucleotide sequence of any one of SEQ ID NOs: 2-4 and 21;an nucleotide sequence comprising at least 50 consecutive base pairs of any one SEQ ID Nos; 2-4 and 21;wherein the HLTF nucleotide sequence is differentially methylated in an HLTF-associated disease cell.
47. An oligonucleotide primer for assessing methylation status in the human HLTF gene, wherein said primer specifically hybridizes under high stringent conditions to a bisulfite-converted template sequence selected from the group consisting of SEQ ID NOs: 5-8, 22-25, and a complement of any one SEQ ID NOs: 22-25.
48. The primer of claim 47, wherein the primer is a methylation-specific PCR primer having a sequence selected from the group consisting of SEQ ID NOs: 11, 12, 15, 17, 18, 26, 27, 30, 32, 33, 36, 37, 40, and 41.
49. The primer of claim 48, wherein said primer is a forward primer selected from the group consisting of SEQ ID NOs: 11, 17, 26, 32, 36, and 40.
50. The primer of claim 49, wherein said primer is a reverse primer selected from the group consisting of SEQ ID NOs: 12, 15, 18, 27, 30, 33, 37, and 41.
51. The primer of claim 47, wherein the primer is an unmethylation-specific PCR primer having a sequence selected from the group consisting of SEQ ID NOs: 13, 14, 16, 19, 20, 28, 29, 31, 34, 35, 37, 38, 42, and 43.
52. The primer of claim 51, wherein the primer is a forward primer selected from the group consisting of SEQ ID NOs: 13, 19, 28, 34, 38, and 42.
53. The primer of claim 51, wherein the primer is a reverse primer selected from the group consisting of SEQ ID NOs: 14, 16, 20, 29, 31, 35, 37, and 43.
54. The primer of claim 47, wherein said primer is further labeled with a detectable marker.
55. The primer of claim 54, wherein said primer is labeled with a fluorescent dye.
56. The kit of claim 38, wherein said kit comprises a forward primer selected from the group consisting of SEQ ID NOs: 11, 17, 26, 32, 36, and 40, and a reverse primer selected from the group consisting of SEQ ID NOs: 12, 15, 18, 27, 30, 33, 37, and 41.
57. The kit of claim 56, wherein said kit further comprises a forward primer selected from the group consisting of SEQ ID NOs: 13, 19, 28, 34, 38, and 42, and a reverse primer selected from the group consisting of SEQ ID NOs: 14, 16, 20, 29, 31, 35, 37, and 43.
58. An oligonucleotide primer for assessing methylation of the human HLTF gene, wherein the primer has a sequence selected from the group consisting of SEQ ID NOs: 9-10.
59. A kit for detecting HLTF-associated neoplasia in a subject, comprising at least two primers of claim 54.
Description:
CROSS-REFERENCE TO RELATED APPLICATIONS
[0001]This application is a divisional of U.S. Provisional application Ser. No. 10/266,103, filed Oct. 7, 2002, which claims the benefit of priority of U.S. Provisional Application No. 60/327,537 filed on Oct. 5, 2001. The entire teachings of the referenced Applications are incorporated herein by reference in its entirety.
BACKGROUND
[0002]In 2001, over 1.2 million new cases of human cancer will be diagnosed and over 0.5 million people will die from cancer (American Cancer Society estimate). Despite this, more people than ever are living with and surviving cancer. In 1997, for example, approximately 8.9 million living Americans had a history of cancer (National Cancer Institute estimate). People are more likely to survive cancer if the disease is diagnosed at an early stage of development, since treatment at that time is more likely to be successful. Early detection depends upon availability of high-quality methods. Such methods are also useful for determining patient prognosis, selecting therapy, monitoring response to therapy and selecting patients for additional therapy. Consequently, there is a need for cancer diagnostic methods that are specific, accurate, minimally invasive, technically simple and inexpensive.
[0003]Colorectal cancer (i.e., cancer of the colon or rectum) is one particularly important type of human cancer. Colorectal cancer is the second most common cause of cancer mortality in adult Americans (Landis, et al., 1999, CA Cancer J Clin, 49:8-31). Approximately 40% of individuals with colorectal cancer die. In 2001, it is estimated that there will be 135,400 new cases of colorectal cancer (98,200 cases of colon and 37,200 cases of rectal cancer) and 56,700 deaths (48,000 colon cancer and 8,800 rectal cancer deaths) from the disease (American Cancer Society). As with other cancers, these rates can be decreased by improved methods for diagnosis.
[0004]Although methods for detecting colon cancer exist, the methods are not ideal. Digital rectal exams (i.e., manual probing of rectum by a physician), for example, although relatively inexpensive, are unpleasant and can be inaccurate. Fecal occult blood testing (i.e., detection of blood in stool) is nonspecific because blood in the stool has multiple causes. Colonoscopy and sigmoidoscopy (i.e., direct examination of the colon with a flexible viewing instrument) are both uncomfortable for the patient and expensive. Double-contrast barium enema (i.e., taking X-rays of barium-filled colon) is also an expensive procedure, usually performed by a radiologist.
[0005]Because of the disadvantages of existing methods for detecting colon cancer, new methods are needed.
SUMMARY OF THE INVENTION
[0006]In accordance with the present invention, new methods are provided for detection of colon cancer. In one aspect, the method comprises assaying for the presence of differentially methylated HLTF nucleotide sequences in a tissue sample or a bodily fluid sample from a subject. Preferred bodily fluids include blood, serum, plasma, a blood-derived fraction, stool, colonic effluent or urine. In one embodiment, the method involves restriction enzyme/methylation-sensitive PCR. In another embodiment, the method comprises reacting DNA from the sample with a chemical compound that converts non-methylated cytosine bases (also called "conversion-sensitive" cytosines), but not methylated cytosine bases, to a different nucleotide base. In a preferred embodiment, the chemical compound is sodium bisulfite, which converts unmethylated cytosine bases to uracil. The compound-converted DNA is then amplified using a methylation-sensitive polymerase chain reaction (MSP) employing primers that amplify the compound-converted DNA template if cytosine bases within CpG dinucleotides of the DNA from the sample are methylated. Production of a PCR product indicates that the subject has colon cancer. Other methods for assaying for the presence of methylated DNA are known in the art.
[0007]In another aspect, the method comprises assaying for decreased levels of an HLTF transcript in the sample. Examples of such assays include RT-PCR assays which employ primers that derived from the coding sequence of HLTF.
[0008]In another aspect, the present invention provides a detection method for prognosis of a colon cancer in a subject known to have or suspected of having colon cancer. Such method comprises assaying for the presence of differentially methylated HLTF nucleotide sequences in a tissue sample or bodily fluid from the subject. In certain cases, presence of differentially methylated HLTF nucleotide sequences in the tissue sample bodily fluid indicates that the subject is a good candidate for a particular therapy. In other cases, presence of the differentially methylated HLTF nucleotide sequences in the tissue sample or bodily fluid indicates that the colon cancer has a poor prognosis or the subject is a candidate for more aggressive therapy.
[0009]In another aspect, the present invention provides a method for monitoring over time the status of colon cancer in a subject. The method comprises assaying for the presence of differentially methylated HLTF nucleotide sequences in a tissue sample or bodily fluid taken from the subject at a first time and in a corresponding bodily fluid taken from the subject at a second time. Absence of differentially methylated HLTF nucleotide sequences from the bodily fluid taken at the first time and presence of differentially methylated HLTF nucleotide sequences in the bodily fluid taken at the second time indicates that the cancer is progressing. Presence of differentially methylated HLTF nucleotide sequences in the tissue sample or bodily fluid taken at the first time and absence of differentially methylated HLTF nucleotide sequences from the tissue sample or bodily fluid taken at the second time indicates that the cancer is regressing.
[0010]In another aspect the present invention provides a method for evaluating therapy in a subject suspected of having or having colon cancer. The method comprises assaying for the presence of methylated HLTF promoter DNA in a tissue sample or bodily fluid taken from the subject prior to therapy and a corresponding bodily fluid taken from the subject during or following therapy. Loss of methylated HLTF promoter DNA or a decrease in methylation of HLTF promoter DNA in the sample taken after or during therapy as compared to the levels or HLTF promoter DNA in the sample taken before therapy is indicative of a positive effect of the therapy on cancer regression in the treated subject
[0011]The present invention also provides nucleotide primer sequences for use in the methylation-sensitive PCR assay.
[0012]The present invention also provides a method of inhibiting or reducing growth of colon cancer cells. The method comprises increasing the levels of the protein encoded by HLTF in colon cancer cells. In one embodiment, the cells are contacted with the HLTF protein or a biologically active equivalent or fragment thereof under conditions permitting uptake of the protein or fragment. In another embodiment, the cells are contacted with a nucleic acid encoding the HLTF protein and (ii) a promoter active in the cancer cell, wherein the promoter is operably linked to the region encoding the HLTF protein, under conditions permitting the uptake of the nucleic acid by the cancer cell. In another embodiment, the method comprises demethylating the methylated HLTF promoter DNA.
[0013]In one aspect, the application provides isolated or recombinant HLTF nucleotide sequences that are at least 80%, 85%, 90%, 95%, 98%, 99% or identical to the nucleotide sequence of any one of SEQ ID NOs: 2-4 and 21, fragments of said sequences that are 10, 15, 20, 25, 50, 100, or 150 base pairs in length wherein the HLTF nucleotide sequences are differentially methylated in an HLTF-associated disease cell.
BRIEF DESCRIPTION OF THE DRAWINGS
[0014]FIG. 1 illustrates HLTF silencing in colon cancer cell lines. A. HLTF RNA expression. Shown is an RT-PCR assay for HLTF expression in colon cancer cell lines. B. HLTF expression reactivation. Shown is an RT-PCR assay for HLTF expression in colon cancer cell lines treated (+) or untreated (-) with 5-azacytdine (5-azaC). Cell lines V429 and V503 are controls with constitutive HLTF expression. 5-azaC treatment reactivates HLTF expression in cell lines RCA, V457, SW480, V5, V6, and V432.
[0015]FIG. 2 shows methylation of HLTF 5' genomic sequence. A. Diagram of the HLTF 5' genomic region. CpG sites are shown with circles. Shaded circles represent CpG sites that are tested in MS-PCR assays. Hatched circles represent CpG sites that overlap HpaII restriction sites. B. MS-PCR assay of the HLTF 5' genomic sequence. Shown are the results of MS-PCR assay of the HLTF 5' genomic sequence by using primers specific for amplification of either methylated (M) or unmethylated (U) templates. C. HLTF MS-PCR of matched cell lines and tissues. Shown are the results of HLTF MS-PCR assay of colon cancer cell lines (C), matched antecedent tumor tissue (T) or matched normal colon mucosa (N).
[0016]FIG. 3 shows methylation status of the HLTF 5' genomic region in primary tumors and matched normal tissues. Shown are the results of MS-PCR assay of the HLTF 5' genomic region in matched paired tumor (T) and normal (N) colon tissues samples amplified with primers specific for methylated (M) or unmethylated (U) templates.
[0017]FIG. 4 shows correlation of HLTF 5' genomic region methylation with age. A. In colon cancer tumors and cell lines, B. In normal colon tissues.
[0018]FIG. 5 shows correlation of HLTF 5' genomic region methylation with tumor site (A, B) or with tumor stage (C, D). Shown in A and B are percentage (%) of colon neoplasms (tumors and cell lines) in each category defined by location of the tumor in the colon and HLTF methylation status. Shown in C and D are percentage (%) of colon neoplasms (tumors and cell lines) in each category defined by clinical stage of the colon tumor and HLTF methylation status.
[0019]FIG. 6 illustrates correlation of HLTF methylation with the CpG island methylator phenotype (i.e., CIMP status) and with hMLH1 methylation. A. Shown are the numbers of primary colon cancers in each of the categories defined by combined HLTF methylation and CIMP status. B. Shown are the numbers of colon cancers (tumors and cell lines) in each category defined by combined hMLH1 and HLTF methylation status.
[0020]FIG. 7 shows HLTF colony suppressor activity. Shown are the number of G418 resistant colonies arising from transfection with an HLTF expression vector (HLTF) or a control empty expression vector (pcDNA) in HLTF unmethylated and expressing FET, V364 and V429 cells (B) as compared to HLTF methylated and deficient V457, V8-2, and RCA cells (A). C. Anti-V5 western blot assay of V5-epitope tagged HLTF introduced by transient transfection into HLTF methylated versus unmethylated cells. Control cells were transfected with an empty expression vector (pcDNA3.1).
[0021]FIG. 8 shows a diagram of the HLTF 5' genomic region. CpG sites are shown with circles and stems. Hatched circles represent CpG sites that overlap HpaII restriction sites. Sequences that are complementary to PCR primers that were used to selectively amplify the methylated but not unmethylated HLTF 5' genomic sequence after digestion with HpaII are designated by the location of the arrows corresponding to forward PCR primer 1277F and reverse PCR primer 1724R. Shaded circles represent the CpG sites that are tested by MS-PCR assay primers described as examples in this application. Locations of specific primers used in the specific MS-PCR assays described as examples in this application are indicated with arrows and correspond to forward PCR primer 1352MF, designed as a forward primer for amplification of bisulfite converted sense sequences of duplex DNA derived from the methylated parental sense strand, and reverse primers 1606MR and 1627MR, designed as reverse primers for amplification of bisulfite converted sequences of duplex DNA derived from methylated parental sense strand. Primers 1352MF(ASS) and 1607MR(ASS) indicate the forward and reverse primers for amplification of duplex DNA derived from bisulfite converted sequences of methylated parental antisense strands. Further, the control primers (indicated as UF or UR) that are used to detect the unmethylated HLTF template in an MSP are also indicated in the diagram.
[0022]FIG. 9 shows the structure of the 5' region of HLTF at 3 levels of resolution. The top panel (A) depicts residues 0-4500 spanning exons 1 and exons 2, and depicting the position of three Alu repeats in which Alu1 is upstream of exon 1, Alu 2 is in intron 1, and Alu 3 is in intron 3. Balloons designate the positions of CpG dinucleotide sequences. The second panel (B) depicts at higher magnification the structure between residues 0-3000, spanning Alu1, exon1, and Alu2. The third panel (C) shows at higher magnification the region from residues 550 to 2459 that is between Alu1 and Alu2.
[0023]FIG. 10 summarizes the results of the sequencing across HLTF genomic residues 0-3000 of bisulfite converted genomic DNA from 6 different samples: Vaco5 (an HLTF silenced colon cancer cell line), Vaco206 (an HLTF expressing colon cancer cell line), and 4 normal colon epithelial tissue samples (19-11N, 587N, 421N, and 406N). Multiple individual DNA clones were sequenced for each of the bisulfite converted samples. The positions of Alu1, HLTF exon1, and Alu2 are shown in boxes. Open balloons denote CpG residues where cytosine methylation was found to be constitutive in normal colonic tissue. In both Alu1 and Alu2, all CpG residues are constitutively methylated in normal tissue. Additionally, CpG residues that are 3' of Alu1 from bases 550-1200 were also all found to be constitutively methylated. A differentially methylated region, that is methylated in HLTF silenced Vaco5 and is, in general, not methylated in normal colon or in HLTF expressing cancer is defined by the CpG dinucleotides lying between residues 1200 and 2600. The differential methylation of 5 HpaII sites between residues 1277 and 1742 was independently confirmed in multiple additional normal and cancer samples by assays of the resistance of these sites to HpaII digestion. Additionally, 3 sets of MS-PCR primers were designed to assay the methylation status of residues between 1352 and 1672 (as shown on the figure), and these MS-PCR assays also confirmed that these residues were unmethylated in normal colon tissue and in HLTF expressing colon cancers, but were methylated in HLTF silenced colon cancers.
[0024]FIG. 11 shows a final diagramatic summary of the structure of the region between Alu1 and Alu2 repeats, and designates the boundaries of the base pair 1200-2500 region outside of which there is constitutive methylation of the adjacent CpG dinucleotides.
[0025]FIG. 12 summarizes for HLTF gene residues 550 to 2500 the primary results of the bisulfite sequencing in the 6 different samples (corresponding to the region flanked by the Alu1 and Alu2 repeats). The balloons indicate the position of the CpG dinucleotides. Data from the 6 samples is summarized by the 6 lines. At each CpG residue, an open circle indicates the residue was unmethylated in that sample, a black filled circle indicates the residue was methylated in every bisulfite converted clone derived from that sample, and a grey filled circle indicates that the residue was methylated in from 20%-70% of the clones derived from that sample. In the HLTF silenced Vaco5 sample, there basically is methylation of every residue from base pair 550 to 2500. However, the residues from base pairs 550 to 1200 are also seen to be methylated in the normal colon samples as well as in the HLTF expressing cancer cell line Vaco206. The residues between 1200 and 2500 essentially define a region that is differentially methylated in the HLTF silenced Vaco5 sample. Note is made of slight methylation of residues 1200 to 1400 in one normal sample, and of residues 2300 to 2500 in Vaco206. However, this slight methylation is distinguished from the core methylation of HLTF silenced cancers by all of the MS-PCR assays described in this application as well as by the described assay of methylation of the HpaII sites that span the 1277 to 1742 interval.
[0026]FIG. 13 shows the results of bisulfite sequencing of multiple individually derived DNA clones derived from bisulfite treated DNA from the Vaco5 colon cancer cell lines which do not express HLTF. In each of these Vaco5 derived DNA clones, essentially all of the CpG sites are methylated in the region from residues 550 to 2500.
[0027]FIG. 14 shows the amino acid sequence (SEQ ID NO: 1) of human HLTF protein.
[0028]FIG. 15 shows the 5' genomic sequence of human HLTF gene (residues 1 to 3000, sense strand, SEQ ID NO: 2). The underlined region (residues 1250-1800, SEQ ID NO: 4) was tested by methylation specific PCR and by sensitivity to HpaII digestion. Alu1 and Alu2 regions are in bold. The start ATG is underlined and in bold, with the A at position 1757. There is currently one complete GeneBank entry "AC021059: Homo sapiens 3 BAC RP11-464E15 (Roswell Park Cancer Institute Human BAC Library) complete sequence", which contains the human HLTF gene. Residues 1-3000 correspond to positions 119396-116395 of this genomic clone (AC021059), and the A of the ATG relative to AC020159 will be 117640.
[0029]FIG. 16 shows the sequence of residue 600-2600 (SEQ ID NO: 3) that includes the differentially methylated region of residues 1200-2600, and a portion of the non-Alu constitutively methylated region within residues 600-1200.
[0030]FIG. 17 shows the HLTF nucleotide sequence base pairs 1250-1800 (SEQ ID NO: 4). CpG dinucleotides that are sites of cytosine methylation are in bold. Those CpG sites that are HpaII restriction sites are shown in italic and underlined. Sequences that are complementary to PCR primers that were used to selectively amplify the methylated but not unmethylated HLTF DNA after digestion with HpaII are shown as bold arrows. Those CpG site that are tested by specific MS-PCR assays described as examples in this invention are underlined. Parent sequences that were used to design specific MS-PCR primers that amplified methylated but not unmethylated templates following conversion with bisulfite are shown in smaller arrows.
[0031]FIG. 18 shows the sequences following bisulfite conversion of DNA derived from the sense strand of methylated template (top panel) and unmethylated template (bottom panel) of the HLTF 5' genomic sequence residues 1250-1800. CpG dinucleotides that are sites of cytosine methylation are in bold. Those CpG sites that are HpaII restriction sites are shown in italic and underlined. Sequences that are complementary to PCR primers that were used to selectively amplify the methylated but not unmethylated HLTF DNA after digestion with HpaII are shown as bold arrows. Those CpG site that are tested by specific MS-PCR assays described as examples in this invention are shown underlined. Sequences that were used to design specific MS-PCR primers that amplified methylated but not unmethylated templates following conversion with bisulfite are shown in smaller arrows.
[0032]FIG. 19 shows the corresponding complementary strands of the bisulfite-converted HLTF DNA base pairs 1250-1800 (methylated and unmethylated templates, as shown in FIG. 18). CpG dinucleotides that are sites of cytosine methylation are in bold. Those CpG sites that are HpaII restriction sites are shown in italic and underlined. Sequences that are complementary to PCR primers that were used to selectively amplify the methylated but not unmethylated HLTF DNA after digestion with HpaII are shown as bold arrows. Those CpG site that are tested by specific MS-PCR assays described as examples in this invention are shown underlined. Sequences that were used to design specific MS-PCR primers that amplified methylated but not unmethylated templates following conversion with bisulfite are shown in smaller arrows.
[0033]FIG. 20 shows primer sequences for amplifying HLTF. A. Forward PCR primer 1277F (SEQ ID NO: 9) and reverse PCR primer 1724R (SEQ ID NO: 10) selectively amplify the methylated but not unmethylated HLTF sequence after digestion with HpaII. Unmethylated DNAs are cut by HpaII and so cannot be PCR amplified B and C show primer sets for amplifying bisulfite-converted sense sequences of the duplex methylated HLTF DNA: forward PCR primer 1352MF (SEQ ID NO: 11) and reverse primer 1606MR (SEQ ID NO: 12); forward PCR primer 1352MF (SEQ ID NO: 11) and reverse primer 1627MR (SEQ ID NO: 15). D shows primer sets for amplifying bisulfite-converted antisense sequences of the duplex methylated HLTF DNA: forward primer 1352MF(ASS) (SEQ ID NO: 17) and reverse primer 1607MR(ASS) (SEQ ID NO: 18). Sequences underlined in B, C, and D are the control primer sets used to amply bisulfite-converted sequences (sense or antisense) of the duplex unmethylated HLTF DNA (designated as UF or UR): forward PCR primer 1347UF (SEQ ID NO: 13) and reverse primer 1610UR (SEQ ID NO: 14); forward PCR primer 1347UF (SEQ ID NO: 13) and reverse primer 1631UR (SEQ ID NO: 16); forward primer 1349UF(ASS) (SEQ ID NO: 19) and reverse primer 1611UR(ASS) (SEQ ID NO: 20).
[0034]FIG. 21 shows the HLTF 5' genomic sequence (residues 1200-2500, sense strand, SEQ ID NO: 21). The region is differentially methylated as shown in FIG. 10.
[0035]FIG. 22 shows the sequence (SEQ ID NO: 22), derived from bisulfite conversion of DNA derived from the sense strand of methylated template of the HLTF 5' genomic sequence residues 1200-2500 (i.e., SEQ ID NO: 21). The underlined region (residues 1250-1800) was tested by MSP assay. The start ATG is in bold.
[0036]FIG. 23 shows the sequence (SEQ ID NO: 23), derived from bisulfite conversion of DNA derived from the sense strand of unmethylated template of the HLTF 5' genomic sequence residues 1200-2500 (i.e., SEQ ID NO: 21). The underlined region (residues 1250-1800) was tested by MSP assay. The start ATG is in bold.
[0037]FIG. 24 shows the sequence (SEQ ID NO: 24), derived from bisulfite conversion of DNA derived from the antisense strand of methylated template of the HLTF 5' genomic sequence residues 1200-2500 (i.e., SEQ ID NO: 21). The underlined region (residues 1250-1800) was tested by MSP assay. The start ATG is in bold.
[0038]FIG. 25 shows the sequence (SEQ ID NO: 25), derived from bisulfite conversion of DNA derived from the antisense strand of unmethylated template of the HLTF 5' genomic sequence residues 1200-2500 (i.e., SEQ ID NO: 21). The underlined region (residues 1250-1800) was tested by MSP assay. The start ATG is in bold.
[0039]FIG. 26 shows diagrammatically the positions of newly designed primer sets 4-7 for detecting differential methylation of the 3' HLTF region that is beyond the previously-tested region.
[0040]FIG. 27 shows the sequences of the proposed primer sets 4-8. MSP4, MSP5, MSP7, and MSP8 are primer sets for amplifying bisulfite-converted antisense sequences of the duplex methylated HLTF DNA, including: forward primer 1P-HLTF1581MF(ASS) (SEQ ID NO: 26) and reverse primer 13P-HLTF1713MR(ASS) (SEQ ID NO: 27); forward primer 1P-HLTF1581MF(ASS) (SEQ ID NO: 26) and reverse primer 5P-HLTF1827MR(ASS) (SEQ ID NO: 30); forward primer 9P-HLTF1893MF(ASS) (SEQ ID NO: 36) and reverse primer ALU(MB)2133FR(ASS) (SEQ ID NO: 37); forward primer 15P-HLTF2201MF(ASS) (SEQ ID NO: 40) and reverse primer 11P-HLTF2400MR(ASS) (SEQ ID NO: 41). MSP6 are primer sets for amplifying bisulfite-converted sense sequences of the duplex methylated HLTF DNA, including forward primer 3P-HLTF1621MF (SEQ ID NO: 32) and reverse primer 7P-HLTF1873MR (SEQ ID NO: 33). Sequences underlined are the control primer sets used to amplify bisulfite-converted sequences (sense or antisense) of the duplex unmethylated HLTF DNA (designated as UF or UR), including: forward primer 2P-HLTF1575UF(ASS) (SEQ ID NO: 28) and reverse primer 14P-HLTF1728UR(ASS) (SEQ ID NO: 29); forward primer 2P-HLTF1575UF(ASS) (SEQ ID NO: 28) and reverse primer 6P-HLTF1829UR(ASS) (SEQ ID NO: 31); forward primer 4P-HLTF1614UF (SEQ ID NO: 34) and reverse primer 8P-HLTF1878UR (SEQ ID NO: 35); forward primer 10P-HLTF1890UF(ASS) (SEQ ID NO: 38) and reverse primer ALU(MB)2133FR(ASS) (SEQ ID NO: 37); forward primer 16P-HLTF2197UF(ASS) (SEQ ID NO: 42) and reverse primer 12P-HLTF2403UR(ASS) (SEQ ID NO: 43).
[0041]FIG. 28 shows a region of the Genomic clone AC021059 (residues 58381-120901) (SEQ ID NO: 39), encompassing the HLTF gene. The HLTF gene is located on the antisense strand of the clone.
DETAILED DESCRIPTION OF THE INVENTION
I. Definitions
[0042]For convenience, certain terms employed in the specification, examples, and appended claims are collected here. Unless defined otherwise, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs.
[0043]The articles "a" and "an" are used herein to refer to one or to more than one (i.e., to at least one) of the grammatical object of the article. By way of example, "an element" means one element or more than one element.
[0044]The terms "adenoma", "colon adenoma," and "polyp" are used herein to describe any precancerous neoplasia of the colon.
[0045]The term "colon" as used herein is intended to encompass the right colon (including the cecum), the transverse colon, the left colon, and the rectum.
[0046]The terms "colorectal cancer" and "colon cancer" are used interchangeably herein to refer to any cancerous neoplasia of the colon (including the rectum, as defined above).
[0047]The term "blood-derived fraction" herein refers to a component or components of whole blood. Whole blood comprises a liquid portion (i.e., plasma) and a solid portion (i.e., blood cells). The liquid and solid portions of blood are each comprised of multiple components; e.g., different proteins in plasma or different cell types in the solid portion. One of these components or a mixture of any of these components is a blood-derived fraction as long as such fraction is missing one or more components found in whole blood.
[0048]"Cells," "host cells" or "recombinant host cells" are terms used interchangeably herein. It is understood that such terms refer not only to the particular subject cell but to the progeny or potential progeny of such a cell. Because certain modifications may occur in succeeding generations due to either mutation or environmental influences, such progeny may not, in fact, be identical to the parent cell, but are still included within the scope of the term as used herein.
[0049]The terms "compound", "test compound," "agent", and "molecule" are used herein interchangeably and are meant to include, but are not limited to, peptides, nucleic acids, carbohydrates, small organic molecules, natural product extract libraries, and any other molecules (including, but not limited to, chemicals, metals, and organometallic compounds).
[0050]The term "compound-converted DNA" herein refers to DNA that has been treated or reacted with a chemical compound that converts unmethylated C bases in DNA to a different nucleotide base. For example, one such compound is sodium bisulfite, which converts unmethylated C to U. If DNA that contains conversion-sensitive cytosine is treated with sodium bisulfite, the compound-converted DNA will contain U in place of C. If the DNA which is treated with sodium bisulfite contains only methylcytosine, the compound-converted DNA will not contain uracil in place of the methylcytosine.
[0051]The term "de-methylating agent" as used herein refers agents that restore activity and/or gene expression of target genes silenced by methylation upon treatment with the agent Examples of such agents include without limitation 5-azacytidine, 5-aza-2'-deoxycytidine,
[0052]As used herein, the phrase "gene expression" or "protein expression" includes any information pertaining to the amount of gene transcript or protein present in a sample, as well as information about the rate at which genes or proteins are produced or are accumulating or being degraded (e.g., reporter gene data, data from nuclear runoff experiments, pulse-chase data etc.). Certain kinds of data might be viewed as relating to both gene and protein expression. For example, protein levels in a cell are reflective of the level of protein as well as the level of transcription, and such data is intended to be included by the phrase "gene or protein expression information." Such information may be given in the form of amounts per cell, amounts relative to a control gene or protein, in unitless measures, etc.; the term "information" is not to be limited to any particular means of representation and is intended to mean any representation that provides relevant information. The term "expression levels" refers to a quantity reflected in or derivable from the gene or protein expression data, whether the data is directed to gene transcript accumulation or protein accumulation or protein synthesis rates, etc.
[0053]The term "detection" is used herein to refer to any process of observing a marker, or a change in a marker (such as for example the change in the methylation state of the marker), in a biological sample, whether or not the marker or the change in the marker is actually detected. In other words, the act of probing a sample for a marker or a change in the marker, is a "detection" even if the marker is determined to be not present or below the level of sensitivity. Detection may be a quantitative, semi-quantitative or non-quantitative observation.
[0054]"differentially methylated HLTF nucleotide sequence" refers to a region of the HLTF nucleotide sequence that is found to be methylated in an HLTF-associated neoplasia such as a region of the HLTF nucleotide sequence that is found to be methylated in colon cancer tissues or cell lines, but not methylated in the normal tissues or cell lines. For example, FIG. 10 delineates certain HLTF regions that are differentially methylated regions. Illustrative examples of such differentially methylated HLTF regions are set forth in SEQ ID NOs: 4 and 21.
[0055]"Expression vector" refers to a replicable DNA construct used to express DNA which encodes the desired protein and which includes a transcriptional unit comprising an assembly of (1) genetic element(s) having a regulatory role in gene expression, for example, promoters, operators, or enhancers, operatively linked to (2) a DNA sequence encoding a desired protein (in this case, an HLTF protein) which is transcribed into mRNA and translated into protein, and (3) appropriate transcription and translation initiation and termination sequences. The choice of promoter and other regulatory elements generally varies according to the intended host cell. In general, expression vectors of utility in recombinant DNA techniques are often in the form of "plasmids" which refer to circular double stranded DNA loops which, in their vector form are not bound to the chromosome. In the present specification, "plasmid" and "vector" are used interchangeably as the plasmid is the most commonly used form of vector. However, the invention is intended to include such other forms of expression vectors which serve equivalent functions and which become known in the art subsequently hereto.
[0056]In the expression vectors, regulatory elements controlling transcription or translation can be generally derived from mammalian, microbial, viral or insect genes. The ability to replicate in a host, usually conferred by an origin of replication, and a selection gene to facilitate recognition of transformants may additionally be incorporated. Vectors derived from viruses, such as retroviruses, adenoviruses, and the like, may be employed.
[0057]The terms "healthy", "normal," and "non-neoplastic" are used interchangeably herein to refer to a subject or particular cell or tissue that is devoid (at least to the limit of detection) of a disease condition, such as a neoplasia, that is associated with HLTF such as for example neoplsia associated with silencing of HLTF gene expression due to methylation. These terms are often used herein in reference to tissues and cells of the colon. Thus, for the purposes of this application, a patient with severe heart disease but lacking a HLTF silencing-associated disease would be termed "healthy."
[0058]"HLTF-associated neoplasia" refers to neoplasia associated with reduced expression or no expression of the HLTF gene. Examples of HLTF-associated neoplasia include gastro-intestinal neoplasia, colon neoplasia etc.
[0059]"HLTF-associated proliferative disorder" refers to a disease that is associated with either reduced expression or over-expression of the HLTF gene.
[0060]"HLTF-methylation target regions" as used herein refer to those regions of HLTF that are found to be methylated. These regions include nucleotide regions that may be either constitutively or differentially methylated regions. For example, FIG. 10 discloses an HLTF region wherein certain regions of the sequence are constitutively methylated and certain other regions are differentially methylated regions. Illustrative examples of such HLTF methylation target regions are set forth in SEQ ID NO: 2-3 and 39.
[0061]"HLTF-nucleotide sequence" or "HLTF-nucleic acid sequence" as used herein refers to the HLTF-genomic sequences as set forth in SEQ ID NO: 39 and to the 5'-genomic flanking regulatory regions as set forth in SED ID NOs: 2-4 and 21.
[0062]"HLTF-silencing associated diseases" as used herein includes HLTF-associated neoplasia.
[0063]"Homology" or "identity" or "similarity" refers to sequence similarity between two peptides or between two nucleic acid molecules. Homology and identity can each be determined by comparing a position in each sequence which may be aligned for purposes of comparison. When an equivalent position in the compared sequences is occupied by the same base or amino acid, then the molecules are identical at that position; when the equivalent site occupied by the same or a similar amino acid residue (e.g., similar in steric and/or electronic nature), then the molecules can be referred to as homologous (similar) at that position. Expression as a percentage of homology/similarity or identity refers to a function of the number of identical or similar amino acids at positions shared by the compared sequences. A sequence which is "unrelated" or "non-homologous" shares less than 40% identity, preferably less than 25% identity with a sequence of the present invention. In comparing two sequences, the absence of residues (amino acids or nucleic acids) or presence of extra residues also decreases the identity and homology/similarity.
[0064]The term "homology" describes a mathematically based comparison of sequence similarities which is used to identify genes or proteins with similar functions or motifs. The nucleic acid and protein sequences of the present invention may be used as a "query sequence" to perform a search against public databases to, for example, identify other family members, related sequences or homologs. Such searches can be performed using the NBLAST and XBLAST programs (version 2.0) of Altschul, et al. (1990) J. Mol. Biol. 215:403-10. BLAST nucleotide searches can be performed with the NBLAST program, score=100, wordlength=12 to obtain nucleotide sequences homologous to nucleic acid molecules of the invention. BLAST protein searches can be performed with the XBLAST program, score=50, wordlength=3 to obtain amino acid sequences homologous to protein molecules of the invention. To obtain gapped alignments for comparison purposes, Gapped BLAST can be utilized as described in Altschul et al., (1997) Nucleic Acids Res. 25(17):3389-3402. When utilizing BLAST and Gapped BLAST programs, the default parameters of the respective programs (e.g., XBLAST and BLAST) can be used. See http://www.ncbi.nlm.nih.gov.
[0065]As used herein, "identity" means the percentage of identical nucleotide or amino acid residues at corresponding positions in two or more sequences when the sequences are aligned to maximize sequence matching, i.e., taking into account gaps and insertions. Identity can be readily calculated by known methods, including but not limited to those described in (Computational Molecular Biology, Lesk, A. M., ed., Oxford University Press, New York, 1988; Biocomputing: Informatics and Genome Projects, Smith, D. W., ed., Academic Press, New York, 1993; Computer Analysis of Sequence Data, Part I, Griffin, A. M., and Griffin, H. G., eds., Humana Press, New Jersey, 1994; Sequence Analysis in Molecular Biology, von Heinje, G., Academic Press, 1987; and Sequence Analysis Primer, Gribskov, M. and Devereux, J., eds., M Stockton Press, New York, 1991; and Carillo, H., and Lipman, D., SIAM J. Applied Math., 48: 1073, 1988). Methods to determine identity are designed to give the largest match between the sequences tested. Moreover, methods to determine identity are codified in publicly available computer programs. Computer program methods to determine identity between two sequences include, but are not limited to, the GCG program package (Devereux, J., et al., Nucleic Acids Research 12(1): 387 (1984)), BLASTP, BLASTN, and FASTA (Altschul, S. F. et al., J. Molec. Biol. 215: 403-410 (1990) and Altschul et al. Nuc. Acids Res. 25: 3389-3402 (1997)). The BLAST X program is publicly available from NCBI and other sources (BLAST Manual, Altschul, S., et al., NCBI NLM NIH Bethesda, Md. 20894; Altschul, S., et al., J. Mol. Biol. 215: 403-410 (1990)). The well known Smith Waterman algorithm may also be used to determine identity.
[0066]The term "including" is used herein to mean, and is used interchangeably with, the phrase "including but not limited to."
[0067]The term "isolated" as used herein with respect to nucleic acids, such as DNA or RNA, refers to molecules in a form which does not occur in nature. Moreover, an "isolated nucleic acid" is meant to include nucleic acid fragments which are not naturally occurring as fragments and would not be found in the natural state.
[0068]The term "methylation-sensitive PCR" (i.e., MSP) herein refers to a polymerase chain reaction in which amplification of the compound-converted template sequence is performed. Two sets of primers are designed for use in MSP. Each set of primers comprises a forward primer and a reverse primer. One set of primers, called methylation-specific primers (see below), will amplify the compound-converted template sequence if C bases in CpG dinucleotides within the HLTF DNA are methylated. Another set of primers, called unmethylation-specific primers (see below), will amplify the compound-converted template sequences if C bases in CpG dinucleotides within the HLTF DNA are not methylated.
[0069]As used herein, the term "nucleic acid" refers to polynucleotides such as deoxyribonucleic acid (DNA), and, where appropriate, ribonucleic acid (RNA). The term should also be understood to include, as equivalents, analogs of either RNA or DNA made from nucleotide analogs, and, as applicable to the embodiment being described, single-stranded (such as sense or antisense) and double-stranded polynucleotides.
[0070]"Operably linked" when describing the relationship between two DNA regions simply means that they are functionally related to each other. For example, a promoter or other transcriptional regulatory sequence is operably linked to a coding sequence if it controls the transcription of the coding sequence.
[0071]The term "or" is used herein to mean, and is used interchangeably with, the term "and/or", unless context clearly indicates otherwise.
[0072]The terms "proteins" and "polypeptides" are used interchangeably herein.
[0073]A "sample" includes any material that is obtained or prepared for detection of a molecular marker or a change in a molecular marker such as for example the methylation state, or any material that is contacted with a detection reagent or detection device for the purpose of detecting a molecular marker or a change in the molecular marker.
[0074]A "subject" is any organism of interest, generally a mammalian subject, such as a mouse, and preferably a human subject.
[0075]As used herein, the term "specifically hybridizes" refers to the ability of a nucleic acid probe/primer of the invention to hybridize to at least 12, 15, 20, 25, 30, 35, 40, 45, 50 or 100 consecutive nucleotides of a target sequence, or a sequence complementary thereto, or naturally occurring mutants thereof, such that it has less than 15%, preferably less than 10%, and more preferably less than 5% background hybridization to a cellular nucleic acid (e.g., mRNA or genomic DNA) other than the target gene. A variety of hybridization conditions may be used to detect specific hybridization, and the stringency is determined primarily by the wash stage of the hybridization assay. Generally high temperatures and low salt concentrations give high stringency, while low temperatures and high salt concentrations give low stringency. Low stringency hybridization is achieved by washing in, for example, about 2.0×SSC at 50° C., and high stringency is achieved with about 0.2×SSC at 50° C. Further descriptions of stringency are provided below.
[0076]As applied to polypeptides, "substantial sequence identity" means that two peptide sequences, when optimally aligned, such as by the programs GAP or BESTFIT using default gap which share at least 90 percent sequence identity, preferably at least 95 percent sequence identity, more preferably at least 99 percent sequence identity or more. Preferably, residue positions which are not identical differ by conservative amino acid substitutions. For example, the substitution of amino acids having similar chemical properties such as charge or polarity are not likely to effect the properties of a protein. Examples include glutamine for asparagine or glutamic acid for aspartic acid.
[0077]As used herein, the term "transgene" means a nucleic acid sequence (encoding, e.g., an HLTF polypeptide), which is partly or entirely heterologous (i.e., foreign) to the transgenic animal or cell into which it is introduced, or, is homologous to an endogenous gene of the transgenic animal or cell into which it is introduced, but which is designed to be inserted, or is inserted, into the animal's genome in such a way as to alter the genome of the cell into which it is inserted (e.g., it is inserted at a location which differs from that of the natural gene or its insertion results in a knockout). An HLTF transgene can include one or more transcriptional regulatory sequences and any other nucleic acid, such as introns, that may be necessary for optimal expression of a selected nucleic acid. An HLTF transgene can include a HLTF nucleotide sequence (SEQ ID NO: 3) or fragments thereof.
II. Overview
[0078]In certain aspects, the invention relates to methods for determining whether a patient is likely or unlikely to have a colon neoplasia. A colon neoplasia is any cancerous or precancerous growth located in, or derived from, the colon. The colon is a portion of the intestinal tract that is roughly three feet in length, stretching from the end of the small intestine to the rectum. Viewed in cross section, the colon consists of four distinguishable layers arranged in concentric rings surrounding an interior space, termed the lumen, through which digested materials pass. In order, moving outward from the lumen, the layers are termed the mucosa, the submucosa, the muscularis propria and the subserosa. The mucosa includes the epithelial layer (cells adjacent to the lumen), the basement membrane, the lamina propria and the muscularis mucosae. In general, the "wall" of the colon is intended to refer to the submucosa and the layers outside of the submucosa. The "lining" is the mucosa.
[0079]Precancerous colon neoplasias are referred to as adenomas or adenomatous polyps. Adenomas are typically small mushroom-like or wart-like growths on the lining of the colon and do not invade into the wall of the colon. Adenomas may be visualized through a device such as a colonoscope or flexible sigmoidoscope. Several studies have shown that patients who undergo screening for and removal of adenomas have a decreased rate of mortality from colon cancer. For this and other reasons, it is generally accepted that adenomas are an obligate precursor for the vast majority of colon cancers.
[0080]When a colon neoplasia invades into the basement membrane of the colon, it is considered a colon cancer, as the term "colon cancer" is used herein. In describing colon cancers, this specification will generally follow the so-called "Dukes" colon cancer staging system. The characteristics that the describe a cancer are generally of greater significance than the particular term used to describe a recognizable stage. The most widely used staging systems generally use at least one of the following characteristics for staging: the extent of tumor penetration into the colon wall, with greater penetration generally correlating with a more dangerous tumor; the extent of invasion of the tumor through the colon wall and into other neighboring tissues, with greater invasion generally correlating with a more dangerous tumor; the extent of invasion of the tumor into the regional lymph nodes, with greater invasion generally correlating with a more dangerous tumor; and the extent of metastatic invasion into more distant tissues, such as the liver, with greater metastatic invasion generally correlating with a more dangerous disease state.
[0081]"Dukes A" and "Dukes B" colon cancers are neoplasias that have invaded into the wall of the colon but have not spread into other tissues. Dukes A colon cancers are cancers that have not invaded beyond the submucosa. Dukes B colon cancers are subdivided into two groups: Dukes B1 and Dukes B2. "Dukes B1" colon cancers are neoplasias that have invaded upto but not through the muscularis propria. Dukes B2 colon cancers are cancers that have breached completely through the muscularis propria. Over a five year period, patients with Dukes A cancer who receive surgical treatment (i.e. removal of the affected tissue) have a greater than 90% survival rate. Over the same period, patients with Dukes B1 and Dukes B2 cancer receiving surgical treatment have a survival rate of about 85% and 75%, respectively. Dukes A, B1 and B2 cancers are also referred to as T1, T2 and T3-T4 cancers, respectively.
[0082]"Dukes C" colon cancers are cancers that have spread to the regional lymph nodes, such as the lymph nodes of the gut. Patients with Dukes C cancer who receive surgical treatment alone have a 35% survival rate over a five year period, but this survival rate is increased to 60% in patients that receive chemotherapy.
[0083]"Dukes D" colon cancers are cancers that have metastasized to other organs. The liver is the most common organ in which metastatic colon cancer is found. Patients with Dukes D colon cancer have a survival rate of less than 5% over a five year period, regardless of the treatment regimen.
[0084]In general, colon neoplasia develops through one of at least three different pathways, termed chromosomal instability, microsatellite instability, and the CpG island methylator phenotype (CIMP). Although there is some overlap, these pathways tend to present somewhat different biological behavior. By understanding the pathway of tumor development, the target genes involved, and the mechanisms underlying the genetic instability, it is possible to implement strategies to detect and treat the different types of colon neoplasias.
[0085]This application is based at least in part, on the recognition that certain target genes may be silenced or inactivated by the differential methylation of CpG islands in the 5'flanking or promoter regions of the target gene. CpG islands are clusters of cytosine-guanosine residues in a DNA sequence, that are prominently represented in the 5-flanking region or promoter region of about half the genes in our genome. In particular, this application is based at least in part on the recognition that differential methylation of the HLTF nucleotide sequence may be indicative of colon neoplasia. In one aspect, this application discloses that the HLTF gene can be a common target for methylation and epigenetic gene silencing in cancer cells (e.g., a colon neoplasia), and function as a candidate tumor suppressor gene.
[0086]HLTF (helicase-like transcription factor, also called HIP116a, Zbu1, RUSH1a and Smarca3) is a member of the SWI/SNF family. The SWI/SNF family of genes encode members of multiprotein complexes that utilize the energy of ATP hydrolysis to alter nucleosome position or spacing (Muchardt, et al., 1999, J. Mol. Biol., 293:187-198; Sudarsanam, et al., 2000, Trends Genet., 16:345-351). HLTF has 5'-sequence-specific DNA-binding domains and can thus be targeted to specific promoters directly. For example, HLTF protein can bind to a promoter element (i.e., the B Box) of the plasminogen activator inhibitor-1 (PAI-1) gene and induce PAI-1 gene expression (Ding, et al., 1996, DNA Cell Biol. 15:429-442; Zhang, et al., 1997, Gene, 202:31-7). Functional interactions between Sp1 or Sp3 and HLTF were found to mediate basal expression from the PAI-1 gene (Ding, et al., 1999, J. Biol. Chem., 274:19573-19580). Recently, it has been found that HLTF is an activator of beta-globin transcription (Mahajan, et al., 2002, Blood, 99:348-56).
[0087]As noted above, early detection of colon neoplasia, coupled with appropriate intervention, is important for increasing patient survival rates. Present systems for screening for colon neoplasia are deficient for a variety of reasons, including a lack of specificity and/or sensitivity (e.g. Fecal Occult Blood Test, flexible sigmoidoscopy) or a high cost and intensive use of medical resources (e.g. colonoscopy). Alternative systems for detection of colon neoplasia would be useful in a wide range of other clinical circumstances as well. For example, patients who receive surgical and/or pharmaceutical therapy for colon cancer may experience a relapse. It would be advantageous to have an alternative system for determining whether such patients have a recurrent or relapsed colon neoplasia. As a further example, an alternative diagnostic system would facilitate monitoring an increase, decrease or persistence of colon neoplasia in a patient known to have a colon neoplasia. A patient undergoing chemotherapy may be monitored to assess the effectiveness of the therapy.
III. HLTF Nucleic Acids Polypeptides, and Antibodies
[0088]The present invention is based, at least in part, on the observation that HLTF nucleotide sequences are differentially methylated in certain HLTF-associated neoplasia, such as colon neoplasia. In one aspect, the application discloses HLTF nucleotide sequences having certain regions that are differentially methylated in HLTF-associated neoplasia as set forth in SEQ ID NOs: 2-4, 21, and 39. In other embodiments, the application provides nucleotide sequences that are differentially methylated in HLTF-associated neoplasia as set forth in SEQ ID NOs: 4 and 21. Accordingly, in one embodiment the application provides isolated or recombinant nucleotide sequences that are at least 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% identical to a nucleic acid sequence as set forth in SEQ ID NOs: 2-4, 21, and 39. In yet other aspects, the application provides oligonucleotide sequences having at least 50, 75, or 100 consecutive base pairs of any one of the sequences as set forth in SEQ ID NOs: 2-4 and 21.
[0089]In certain alternative embodiments, the application provides the differentially methylated HLTF nucleotide sequence set forth in SEQ ID NOs: 4 and 21 and fragments thereof, wherein detection of methylation in any one of said fragments would be indicative of an HLTF-associated neoplasia such as colon neoplasia. One of ordinary skill in the art will appreciate that HLTF nucleic acid sequences complementary to SEQ ID NOs: 2-4 and 21, variants of SEQ ID NOs: 2-4 and 21 are also within the scope of this invention. Such variant nucleotide sequences include sequences that differ by one or more nucleotide substitutions, additions or deletions, such as for example allelic variants.
[0090]In yet other embodiments, HLTF nucleotide sequences also include nucleotide sequences sequences that will hybridize under highly stringent conditions to nucleotide sequences designated in SEQ ID NOs: 2-4 and 21. As discussed above, one of ordinary skill in the art will understand readily that appropriate stringency conditions which promote DNA hybridization can be varied. One of ordinary skill in the art will understand readily that appropriate stringency conditions which promote DNA hybridization can be varied. For example, one could perform the hybridization at 6.0× sodium chloride/sodium citrate (SSC) at about 45° C., followed by a wash of 2.0×SSC at 50° C. For example, the salt concentration in the wash step can be selected from a low stringency of about 2.0×SSC at 50° C. to a high stringency of about 0.2×SSC at 50° C. In addition, the temperature in the wash step can be increased from low stringency conditions at room temperature, about 22° C., to high stringency conditions at about 65° C. Both temperature and salt may be varied, or temperature or salt concentration may be held constant while the other variable is changed. In one embodiment, the invention provides nucleic acids which hybridize under low stringency conditions of 6×SSC at room temperature followed by a wash at 2×SSC at room temperature.
[0091]In yet another aspect, the application provides the methylated forms of nucleotide sequences as set forth in SEQ ID NOs: 4 and 21, wherein the cytosine bases of the CpG islands present in said sequences are methylated. In other words, the HLTF nucleotide sequences may be either in the methylated status (e.g., as seen in HLTF-associated neoplasias) or in the unmethylated status (e.g., as seen in normal cells). In further embodiments, the HLTF nucleotide sequences of the invention can be isolated, recombinant, and/or fused with a heterologous nucleotide sequence, or in a DNA library.
[0092]In addition to the differentially methylated HLTF nucleotide sequences, the present application discloses constitutively methylated HLTF nucleotide sequences (such as the Alu repeats and the "non-Alu constitutively methylated region" as set forth in FIG. 10). Since such constitutively methylated HLTF nucleotide sequences are methylated in both normal cells and cancer cells, a person skilled in the art would appreciate the significance of detecting the differentially methylated HLTF nucleotide sequences as provided herein. Furthermore, although one of ordinary skill would expect the Alu's to be methylated, the finding of non-Alu constitutively methylated regions shows that one could not have computationally predicted the presence of the differentially methylated regions and their significance in the detection of HLTF-associated neoplasia.
[0093]In certain embodiments, the application contemplates any HLTF nucleotide sequence within the HLTF genomic sequence, SEQ ID NO: 39 (see FIG. 28) that is differentially methylated in HLTF-associated neoplasia cells, but not in normal cells. Thus, assaying of the methylation status of such an HLTF nucleotide sequence can differentiate cancer cells from normal cells.
[0094]In certain embodiments, the present invention provides bisulfite-converted HLTF template DNA sequences as set forth in SEQ ID NOs: 5-8 and 22-25. Such bisulfite-converted HLTF template DNA can be used for detecting the methylation status, for example, by an MSP reaction or by direct sequencing. In yet other embodiments, the bisulfite-converted HLTF nucleotide sequences of the invention also include nucleotide sequences that will hybridize under highly stringent conditions to any nucleotide sequence selected from SEQ ID NOs: 5-8 and 22-25.
[0095]In further aspects, the application provides methods for producing such bisulfite-converted nucleotide sequences, for example, the application provides methods for treating a nucleotide sequence with a bisulfite agent such that the unmethylated cytosine bases are converted to a different nucleotide base such as a uracil.
[0096]In yet other aspects, the application provides oligonucleotide primers for amplifying a region within the HLTF nucleic acid sequence of any one of SEQ ID NOs: 2-4 and 21. In certain aspects, a pair of the oligonucleotide primers (for example, SEQ ID NOs: 9-10) can be used in a detection assay, such as the HpaII assay. In certain aspects, primers used in an MSP reaction can specifically distinguish between methylated and non-methylated HLTF DNA, for example, SEQ ID NOs: 11-20, 26-38, and 40-43).
[0097]The primers of the invention have sufficient length and appropriate sequence so as to provide specific initiation of amplification of HLTF nucleic acids. Primers of the invention are designed to be "substantially" complementary to each strand of the HLTF nucleic acid sequence to be amplified. While exemplary primers are provided in SEQ ID NOs: 11-20, 26-38, and 40-43, it is understood that any primer that hybridizes with the bisulfite-converted HLTF sequence of any one of SEQ ID NOs: 2-4 and 21 are included within the scope of this invention and is useful in the method of the invention for detecting methylated nucleic acid, as described. Similarly, it is understood that any primers that would serve to amplify a methylation sensitive restriction site or sites within the differentially methylated region of SEQ ID NOs: 2-4 or 21 are included within the scope of this invention and is useful in the method of the invention for detecting nucleic methylated nucleic acid, as described.
[0098]The oligonucleotide primers of the invention may be prepared by using any suitable method, such as conventional phosphotriester and phosphodiester methods or automated embodiments thereof. In one such automated embodiment, diethylphosphoramidites are used as starting materials and may be synthesized as described by Beaucage, et al. (Tetrahedron Letters, 22:1859-1862, 1981). One method for synthesizing oligonucleotides on a modified solid support is described in U.S. Pat. No. 4,458,066.
[0099]The various Sequence Identification Numbers that have been used in this application are summarized below:
TABLE-US-00001 TABLE I SEQ Corresponding ID NO Description/Name Figure 1 amino acid sequence of human HLTF protein. FIG. 14. 2 5' genomic sequence of human HLTF gene, FIG. 15. residues 1-3000. 3 5' genomic sequence of human HLTF gene, FIG. 16. residues 600-2600. 4 5' genomic sequence of human HLTF gene, FIG. 17. residues 1250-1800. 5 methylated SEQ ID NO: 4, after bisulfite FIG. 18, conversion. top panel. 6 unmethylated SEQ ID NO: 4, after bisulfite FIG. 18, conversion. bottom panel. 7 complementary strand of SEQ ID NO: 5. FIG. 19, top panel. 8 complementary strand of SEQ ID NO: 6. FIG. 19, bottom panel. 9 P-HLTF1277F FIG. 20. 10 P-HLTF1724R FIG. 20. 11 P-HLTF1352MF FIG. 20. 12 P-HLTF1606MR FIG. 20. 13 P-HLTF1347UF FIG. 20. 14 P-HLTF1610UR FIG. 20. 15 P-HLTF1627MR FIG. 20. 16 P-HLTF1631UR FIG. 20. 17 P-HLTF1352MF(ASS) FIG. 20. 18 P-HLTF1607MR(ASS) FIG. 20. 19 P-HLTF1349UF(ASS) FIG. 20. 20 P-HLTF1611UR(ASS) FIG. 20. 21 5' genomic sequence of human HLTF gene, FIG. 21. residues 1200-2500, sense strand. 22 methylated SEQ ID NO: 21, after bisulfite FIG. 22. conversion. 23 unmethylated SEQ ID NO: 21, after bisulfite FIG. 23. conversion. 24 methylated antisense-strand of SEQ ID NO: 21, FIG. 24. after bisulfite conversion. 25 unmethylated antisense-strand of SEQ ID NO: FIG. 25. 21, after bisulfite conversion. 26 1P-HLTF1581MF(ASS) FIG. 27. 27 13P-HLTF1713MR(ASS) FIG. 27. 28 2P-HLTF1575UF(ASS) FIG. 27. 29 14P-HLTF1728UR(ASS) FIG. 27. 30 5P-HLTF1827MR(ASS) FIG. 27. 31 6P-HLTF1829UR(ASS) FIG. 27. 32 3P-HLTF1621MF FIG. 27. 33 7P-HLTF1873MR FIG. 27. 34 4P-HLTF1614UF FIG. 27. 35 8P-HLTF1878UR FIG. 27. 36 9P-HLTF1893MF(ASS) FIG. 27. 37 ALU(MB)2133FR(ASS) FIG. 27. 38 10P-HLTF1890UF(ASS) FIG. 27. 39 HLTF genomic sequence (GenBank accession FIG. 28. No. NT_005616, complementary residues 572873-629300) 40 15P-HLTF2201MF(ASS) FIG. 27 41 11P-HLTF2400MR(ASS) FIG. 27 42 16P-HLTF2197UF(ASS) FIG. 27 43 12P-HLTF2403UR(ASS) FIG. 27
[0100]In certain other aspects, the invention relates to HLTF nucleic acids that encode the HLTF polypeptide of SEQ ID NO: 1 and variants thereof. Variant include sequences that differ by one or more nucleotide substitutions, additions or deletions, such as allelic variants; and will, therefore, include coding sequences that differ from the nucleotide sequence of the coding sequence e.g., due to the degeneracy of the genetic code. In certain embodiments, variant nucleic acids will also include sequences that will hybridize under highly stringent conditions to a nucleotide sequence encoding SEQ ID NO: 1.
[0101]Isolated HLTF nucleic acids which differ from the nucleic acids encoding SEQ ID NO: 1 due to degeneracy in the genetic code are also within the scope of the invention. For example, a number of amino acids are designated by more than one triplet. Codons that specify the same amino acid, or synonyms (for example, CAU and CAC are synonyms for histidine) may result in "silent" mutations which do not affect the amino acid sequence of the protein. However, it is expected that DNA sequence polymorphisms that do lead to changes in the amino acid sequences of the subject proteins will exist among mammalian cells. One skilled in the art will appreciate that these variations in one or more nucleotides (up to about 3-5% of the nucleotides) of the nucleic acids encoding a particular protein may exist among individuals of a given species due to natural allelic variation. Any and all such nucleotide variations and resulting amino acid polymorphisms are within the scope of this invention.
[0102]In certain embodiments, the recombinant HLTF nucleic acid may be operably linked to one or more regulatory nucleotide sequences in an expression construct. Regulatory nucleotide sequences will generally be appropriate for a host cell used for expression. Numerous types of appropriate expression vectors and suitable regulatory sequences are known in the art for a variety of host cells. Typically, said one or more regulatory nucleotide sequences may include, but are not limited to, promoter sequences, leader or signal sequences, ribosomal binding sites, transcriptional start and termination sequences, translational start and termination sequences, and enhancer or activator sequences. Constitutive or inducible promoters as known in the art are contemplated by the invention. The promoters may be either naturally occurring promoters, or hybrid promoters that combine elements of more than one promoter. An expression construct may be present in a cell on an episome, such as a plasmid, or the expression construct may be inserted in a chromosome. In a preferred embodiment, the expression vector contains a selectable marker gene to allow the selection of transformed host cells. Selectable marker genes are well known in the art and will vary with the host cell used.
[0103]In certain aspects, the invention relates to HLTF polypeptide (SEQ ID NO: 1) described herein, and variants polypeptides thereof. In certain embodiments, variant polypeptides have an amino acid sequence that is at least 75% identical to an amino acid sequence as set forth in SEQ ID NO: 1. In other embodiments, the variant polypeptide has an amino acid sequence at least 80%, 85%, 90%, 95%, 97%, 98%, 99% or 100% identical to an amino acid sequence as set forth in SEQ ID NO: 1.
[0104]In certain aspects, variant HLTF polypeptides are agonists or antagonists of the HLTF polypeptide as set forth in SEQ ID NO: 1. Variants of these polypeptides may have a hyperactive or constitutive activity, or, alternatively, act to prevent the tumor suppressor activity of HLTF. For example, a truncated form lacking one or more domain may have a dominant negative effect.
[0105]In certain aspects, isolated peptidyl portions of the HLTF polypeptide can be obtained by screening polypeptides recombinantly produced from the corresponding fragment of the nucleic acid encoding the polypeptide as set forth in SEQ ID NO: 1. In addition, fragments can be chemically synthesized using techniques known in the art such as conventional Merrifield solid phase f-Moc or t-Boc chemistry. The fragments can be produced (recombinantly or by chemical synthesis) and tested to identify those peptidyl fragments which can function as either agonists or antagonists of the tumor suppressor function of HLTF.
[0106]In certain aspects, variant HLTF polypeptides containing one or more fusion domains. Well known examples of such fusion domains include, for example, polyhistidine, Glu-Glu, glutathione S transferase (GST), thioredoxin, protein A, protein G, and an immunoglobulin heavy chain constant region (Fc), maltose binding protein (MBP), which are particularly useful for isolation of the fusion polypeptide by affinity chromatography. For the purpose of affinity purification, relevant matrices for affinity chromatography, such as glutathione-, amylase-, and nickel- or cobalt-conjugated resins are used. Many of such matrices are available in "kit" form, such as the Pharmacia GST purification system and the QIAexpress® system (Qiagen) useful with (HIS6) fusion partners. Another fusion domain well known in the art is green fluorescent protein (GFP). This fusion partner serves as a fluorescent "tag" which allows the fusion polypeptide of the invention to be identified by fluorescence microscopy or by flow cytometry. The GFP tag is useful when assessing subcellular localization of the fusion HLTF polypeptide. The GFP tag is also useful for isolating cells which express the fusion HLTF polypeptide by flow cytometric methods such a fluorescence activated cell sorting (FACS). Fusion domains also include "epitope tags," which are usually short peptide sequences for which a specific antibody is available. Well known epitope tags for which specific monoclonal antibodies are readily available include FLAG, influenza virus haemagglutinin (HA), and c-myc tags. In some cases, the fusion domains have a protease cleavage site, such as for Factor Xa or Thrombin, which allow the relevant protease to partially digest the fusion HLTF polypeptide and thereby liberate the recombinant polypeptide therefrom. The liberated polypeptide can then be isolated from the fusion partner by subsequent chromatographic separation.
[0107]Another aspect of the invention pertains to an isolated antibody specifically immunoreactive with an epitope of an HLTF polypeptide. For example, by using immunogens derived from an HLTF polypeptide (e.g., based on its cDNA sequences), anti-protein/anti-peptide antisera or monoclonal antibodies can be made by standard protocols (See, for example, Antibodies: A Laboratory Manual ed. by Harlow and Lane (Cold Spring Harbor Press: 1988)). A mammal, such as a mouse, a hamster or rabbit can be immunized with an immunogenic form of the HLTF peptide. Techniques for conferring immunogenicity on a protein or peptide include conjugation to carriers or other techniques well known in the art. An immunogenic portion of a polypeptide can be administered in the presence of adjuvant. The progress of immunization can be monitored by detection of antibody titers in plasma or serum. Standard ELISA or other immunoassays can be used with the immunogen as antigen to assess the levels of antibodies.
[0108]In certain embodiment, antibodies of the invention may be useful as diagnostic or therapeutic agents for detecting or treating HLTF-associated diseases.
[0109]The term "antibody" as used herein is intended to include fragments thereof which are also specifically reactive with one of the HLTF polypeptide. Antibodies can be fragmented using conventional techniques and the fragments screened for utility in the same manner as described above for whole antibodies. For example, F(ab)2 fragments can be generated by treating antibody with pepsin. The resulting F(ab)2 fragments can be treated to reduce disulfide bridges to produce Fab fragments. The antibody of the invention is further intended to include bispecific, single-chain, and chimeric and humanized molecules having affinity for the HLTF protein. In preferred embodiments, the antibody further comprises a label attached thereto and able to be detected, (e.g., the label can be a radioisotope, fluorescent compound, enzyme or enzyme co-factor).
IV. Assays and Drug Screening Methodologies
[0110]In certain aspects, the application provides assays and methods using the HLTF nucleotide sequences as molecular markers that distinguish between healthy cells and HLTF-associated diseased cells. For example, in one embodiment, the application provides methods and assays using the HLTF nucleotide sequences as markers that distinguish between healthy cells ands colon neoplasia cells. In one aspect, a molecular marker of the invention is a differentially methylated HLTF nucleotide sequence. In another aspect, another marker provided herein is the HLTF gene expression product.
[0111]In certain embodiments, the invention provides assays for detecting differentially methylated HLTF nucleotide sequences, such as the differential methylation patterns seen in any one of SEQ ID NOs: 2-4, 21 and 39, preferably, SED ID NOs: 4 and 21. Thus, a differentially methylated HLTF nucleotide sequence, in its methylated state, can be a HLTF-associated neoplasia-specific modification that serves as a target for detection using various methods described herein and the methods that are well within the purview of the skilled artisan in view of the teachings of this application.
[0112]In certain embodiments, methods of the present invention assaying for the methylation status of the HLTF nucleotide sequence in combination with one or more genes selected from HIC-1 (hypermethylated in cancer-1), p16, p14, TIMP-3, APC, PTEN, RARβ (retinoic acid receptor β), THBS1, hMLH1, and others. The present application provides that HLTF methylation correlates strongly with a pathway termed as the CpG island methylator phenotype (CIMP+), which may involve methylation of multiple genes, including p16, p14, HIC-1, TIMP-3, APC, PTEN, RARβ, THBS1, and hMLH1.
[0113]In certain aspects, such methods for detecting methylated HLTF nucleotide sequences are based on treatment of HLTF genomic DNA with a chemical compound which converts non-methylated C, but not methylated C (i.e., 5 mC), to a different nucleotide base. One such compound is sodium bisulfite, which converts C, but not 5mC, to U. Methods for bisulfite treatment of DNA are known in the art (Herman, et al., 1996, Proc Natl Acad Sci USA, 93:9821-6; Herman and Baylin, 1998, Current Protocols in Human Genetics, N. E. A. Dracopoli, ed., John Wiley & Sons, 2:10.6.1-10.6.10; U.S. Pat. No. 5,786,146). To illustrate, when an DNA molecule that contains unmethylated C nucleotides is treated with sodium bisulfite to become a compound-converted DNA, the sequence of that DNA is changed (CPU). Detection of the U in the converted nucleotide sequence is indicative of an unmethylated C.
[0114]The different nucleotide base (e.g., U) present in compound-converted nucleotide sequences can subsequently be detected in a variety of ways. In a preferred embodiment, the present invention provides a method of detecting U in compound-converted HLTF DNA sequences by using "methylation sensitive PCR" (MSP) (see, e.g., Herman, et al., 1996, Proc. Natl. Acad. Sci. USA, 93:9821-9826; U.S. Pat. No. 6,265,171; U.S. Pat. No. 6,017,704; U.S. Pat. No. 6,200,756). In MSP, one set of primers (i.e., comprising a forward and a reverse primer) amplifies the compound-converted template sequence if C bases in CpG dinucleotides within the HLTF DNA are methylated. This set of primers is called "methylation-specific primers." Another set of primers amplifies the compound-converted template sequence if C bases in CpG dinucleotides within the HLTF 5' flanking sequence are not methylated. This set of primers is called "unmethylation-specific primers."
[0115]In methyl specific PCR the reactions use the compound-converted DNA from a sample in a subject. In assay for HLTF methylated DNA, methylation-specific primers are used. In the case where C within CpG dinucleotides of the target sequence of the DNA are methylated, the methylation-specific primers will amplify the compound-converted template sequence in the presence of a polymerase and an MSP product will be produced. If C within CpG dinucleotides of the target sequence of the DNA are not methylated, the methylation-specific primers will not amplify the compound-converted template sequence in the presence of a polymerase and an MSP product will not be produced
[0116]It is often also useful to run a control reaction for the detection of unmethylated HLTF DNA. The reactions uses the compound-converted DNA from a sample in a subject and unmethylation-specific primers are used. In the case where C within CpG dinucleotides of the target sequence of the DNA are unmethylated, the unmethylation specific primers will amplify the compound-converted template sequence in the presence of a polymerase and an MSP product will be produced. If C within CpG dinucleotides of the target sequence of the DNA are methylated, the unmethylation-specific primers will not amplify the compound-converted template sequence in the presence of a polymerase and an MSP product will not be produced. Note that a biologic sample will often contain a mixture of both neoplastic cells that give rise to a signal with methylation specific primers, and normal cellular elements that give rise to a signal with unmethylation-specific primers. The unmethyl specific signal is often of use as a control reaction, but does not in this instance imply the absence of colon neoplasia as indicated by the positive signal derived from reactions using the methylation specific primers.
[0117]Primers for an MSP reaction are derived from the compound-converted HLTF template sequence. Herein, "derived from" means that the sequences of the primers are chosen such that the primers amplify the compound-converted template sequence in an MSP reaction. Each primer comprises a single-stranded DNA fragment which is at least 8 nucleotides in length. Preferably, the primers are less than 50 nucleotides in length, more preferably from 15 to 35 nucleotides in length. Because the compound-converted HLTF template sequence can be either the Watson strand or the Crick strand of the double-stranded DNA that is treated with sodium bisulfite, the sequences of the primers is dependent upon whether the Watson or Crick compound-converted template sequence is chosen to be amplified in the MSP. Either the Watson or Crick strand can be chosen to be amplified.
[0118]The compound-converted HLTF template sequence, and therefore the product of the MSP reaction, can be between 20 to 3000 nucleotides in length, preferably between 50 to 500 nucleotides in length, more preferably between 80 to 150 nucleotides in length. Preferably, the methylation-specific primers result in an MSP product of a different length than the MSP product produced by the unmethylation-specific primers.
[0119]A variety of methods can be used to determine if an MSP product has been produced in a reaction assay. One way to determine if an MSP product has been produced in the reaction is to analyze a portion of the reaction by agarose gel electrophoresis. For example, a horizontal agarose gel of from 0.6 to 2.0% agarose is made and a portion of the MSP reaction mixture is electrophoresed through the agarose gel. After electrophoresis, the agarose gel is stained with ethidium bromide. MSP products are visible when the gel is viewed during illumination with ultraviolet light. By comparison to standardized size markers, it is determined if the MSP product is of the correct expected size.
[0120]Other methods can be used to determine whether a product is made in an MSP reaction. One such method is called "real-time PCR." Real-time PCR utilizes a thermal cycler (i.e., an instrument that provides the temperature changes necessary for the PCR reaction to occur) that incorporates a fluorimeter (i.e. an instrument that measures fluorescence). The real-time PCR reaction mixture also contains a reagent whose incorporation into a product can be quantified and whose quantification is indicative of copy number of that sequence in the template. One such reagent is a fluorescent dye, called SYBR Green I (Molecular Probes, Inc.; Eugene, Oreg.) that preferentially binds double-stranded DNA and whose fluorescence is greatly enhanced by binding of double-stranded DNA. When a PCR reaction is performed in the presence of SYBR Green I, resulting DNA products bind SYBR Green I and fluorescence. The fluorescence is detected and quantified by the fluorimeter. Such technique is particularly useful for quantification of the amount of the product in the PCR reaction. Additionally, the product from the PCR reaction may be quantitated in "real-time PCR" by the use of a variety of probes that hybridize to the product including TaqMan probes and molecular beacons. Quantitation may be on an absolute basis, or may be relative to a constitutively methylated DNA standard, or may be relative to an unmethyled DNA standard. In one instance the ratio of methylated HLTF derived product to unmethylated derived HLTF product may be constructed.
[0121]Methods for detecting methylation of the HLTF DNA in this invention are not limited to MSP, and may cover any assay for detecting DNA methylation. Another example method for detecting methylation of the HLTF DNA is by using "methylation-sensitive" restriction endonucleases. Such methods comprise treating the genomic DNA isolated from a subject with an methylation-sensitive restriction endonuclease and then using the restriction endonuclease-treated DNA as a template in a PCR reaction. Herein, methylation-sensitive restriction endonucleases recognize and cleave a specific sequence within the DNA if C bases within the recognition sequence are not methylated. If C bases within the recognition sequence of the restriction endonuclease are methylated, the DNA will not be cleaved. Examples of such methylation-sensitive restriction endonucleases include, but are not limited to HpaII, SmaI, SacII, EagI, MspI, BstUI, and BssHII. In this technique, a recognition sequence for a methylation-sensitive restriction endonuclease is located within the template DNA, at a position between the forward and reverse primers used for the PCR reaction. In the case that a C base within the methylation-sensitive restriction endonuclease recognition sequence is not methylated, the endonuclease will cleave the DNA template and a PCR product will not be formed when the DNA is used as a template in the PCR reaction. In the case that a C base within the methylation-sensitive restriction endonuclease recognition sequence is methylated, the endonuclease will not cleave the DNA template and a PCR product will be formed when the DNA is used as a template in the PCR reaction. Therefore, methylation of C bases can be determined by the absence or presence of a PCR product (Kane, et al., 1997, Cancer Res, 57:808-11). No sodium bisulfite is used in this technique.
[0122]Yet another exemplary method for detecting methylation of the HLTF DNA is called the modified MSP, which method utilizes primers that are designed and chosen such that products of the MSP reaction are susceptible to digestion by restriction endonucleases, depending upon whether the compound-converted template sequence contains CpG dinucleotides or UpG dinucleotides.
[0123]Yet other methods for detecting methylation of the HLTF DNA include the MS-SnuPE methods. This method uses compound-converted HLTF DNA as a template in a primer extension reaction wherein the primers used produce a product, dependent upon whether the compound-converted template contains CpG dinucleotides or UpG dinucleotides (see e.g., Gonzalgo, et al., 1997, Nucleic Acids Res., 25:2529-31).
[0124]Another exemplary method for detecting methylation of the HLTF DNA is called COBRA (i.e., combined bisulfite restriction analysis). This method has been routinely used for DNA methylation detection and is well known in the art (see, e.g., Xiong, et al., 1997, Nucleic Acids Res, 25:2532-4).
[0125]In certain embodiments, the invention provides methods that involve directly sequencing the product resulting from an MSP reaction to determine if the compound-converted HLTF template sequence contains CpG dinucleotides or UpG dinucleotides. Molecular biology techniques such as directly sequencing a PCR product are well known in the art.
[0126]In alternative embodiments, the skilled artisan will appreciate that the present invention is based in part, on the recognition that HLTF functions as a tumor suppressor gene. Accordingly, in certain aspects, the invention provides assays for detecting molecular markers that distinguish between healthy cells and HLTF-associated diseases cells, such as colon neoplasia cells. As described above, one of the molecular markers of the present application includes that methylated HLTF nucleotide sequences. Thus, in one embodiment, assaying for the methylation status of the HLTF nucleotide sequence can be monitored for detecting an HLTF-silencing associated disease.
[0127]This application further provides another molecular marker: the HLTF gene expression transcript or the gene product. Thus, in another embodiment, expression of the HLTF nucleic acid or protein can be monitored for detecting an HLTF-silencing associated disease such as a colon neoplasia.
[0128]In certain embodiments, the invention provides detection methods by assaying the above-mentioned HLTF molecular markers so as to determine whether a patient has or does not have a disease condition. Further, such a disease condition may be characterized by decreased expression of HLTF nucleic acid or protein described herein. In certain embodiments, the invention provides methods for determining whether a patient is or is not likely to have a HLTF-associated disease by detecting the expression of the HLTF nucleotide sequences. In further embodiments, the invention provides methods for determining whether the patient is having a relapse or determining whether a patient's cancer is responding to treatment.
[0129]In a preferred embodiment, the application provides method for detecting colon neoplasia. In certain embodiments, the present invention provides methods for detecting a colon neoplasia that is associated with silencing of HLTF gene. Such methods comprise assaying for the presence of a methylated HLTF nucleotide sequence in a sample obtained from a subject. In other aspects, the invention relates to methods for determining whether a patient is likely or unlikely to have a colon cancer. In further aspects, the invention relates to methods for monitoring colon neoplasia in a subject.
[0130]In certain embodiments, the invention provides assays for detecting HLTF protein or nucleic acid transcript described herein. In certain embodiments, a method of the invention comprises providing a biological sample and probing the biological sample for the HLTF expression which include protein or nucleic acid transcript of the HLTF. Information regarding the HLTF expression status, and optionally the quantitative level of the HLTF expression, may then be used to draw inferences about the nature of the biological sample and, if the biological sample was obtained from a subject, the health state of the subject.
[0131]In certain embodiments, methods of the present invention further comprise assaying for detecting a protein or a nucleic acid transcript selected from p16, THBS1, and hMLH1. The present inventors discovered that HLTF methylation correlates strongly with a pathway termed as the CpG island methylator phenotype (CIMP+), which may involve methylation of multiple genes, including p16, THBS1, and hMLH1. Methylation of these genes may lead to aberrant expression of the gene or the protein.
[0132]In certain embodiments, a method of the invention comprises detecting the presence of HLTF protein in a sample. Optionally, the method involves obtaining a quantitative measure of the HLTF protein in the sample. In view of this specification, one of skill in the art will recognize a wide range of techniques that may be employed to detect and optionally quantitate the presence of a protein. In preferred embodiments, HLTF protein is detected with an antibody. In many embodiments, an antibody-based detection assay involves bringing the sample and the antibody into contact so that the antibody has an opportunity to bind to proteins having the corresponding epitope. In many embodiments, an antibody-based detection assay also typically involves a system for detecting the presence of antibody-epitope complexes, thereby achieving a detection of the presence of the proteins having the corresponding epitope. Antibodies may be used in a variety of detection techniques, including enzyme-linked immunosorbent assays (ELISAs), immunoprecipitations, Western blots. Antibody-independent techniques for identifying a protein may also be employed. For example, mass spectroscopy, particularly coupled with liquid chromatography, permits detection and quantification of large numbers of proteins in a sample. Two-dimensional gel electrophoresis may also be used to identify proteins, and may be coupled with mass spectroscopy or other detection techniques, such as N-terminal protein sequencing. RNA aptamers with specific binding for the protein of interest may also be generated and used as a detection reagent.
[0133]Samples should generally be prepared in a manner that is consistent with the detection system to be employed. For example, a sample to be used in a protein detection system should generally be prepared in the absence of proteases. Likewise, a sample to be used in a nucleic acid detection system should generally be prepared in the absence of nucleases. In many instances, a sample for use in an antibody-based detection system will not be subjected to substantial preparatory steps. For example, urine may be used directly, as may saliva and blood, although blood will, in certain preferred embodiments, be separated into fractions such as plasma and serum.
[0134]In certain embodiments, a method of the invention comprises detecting the presence of an HLTF-expressed nucleic acid, such as an mRNA, in a sample. Optionally, the method involves obtaining a quantitative measure of the HLTF-expressed nucleic acid in the sample. In view of this specification, one of skill in the art will recognize a wide range of techniques that may be employed to detect and optionally quantitate the presence of a nucleic acid. Nucleic acid detection systems generally involve preparing a purified nucleic acid fraction of a sample, and subjecting the sample to a direct detection assay or an amplification process followed by a detection assay. Amplification may be achieved, for example, by polymerase chain reaction (PCR), reverse transcriptase (RT) and coupled RT-PCR. Detection of a nucleic acid is generally accomplished by probing the purified nucleic acid fraction with a probe that hybridizes to the nucleic acid of interest, and in many instances detection involves an amplification as well. Northern blots, dot blots, microarrays, quantitative PCR, and quantitative RT-PCR are all well known methods for detecting a nucleic acid in a sample.
[0135]In certain embodiments, the invention provides nucleic acid probes that bind specifically to an HLTF nucleic acid. Such probes may be labeled with, for example, a fluorescent moiety, a radionuclide, an enzyme or an affinity tag such as a biotin moiety. For example, the TaqMan® system employs nucleic acid probes that are labeled in such a way that the fluorescent signal is quenched when the probe is free in solution and bright when the probe is incorporated into a larger nucleic acid.
[0136]Immunoscintigraphy using monoclonal antibodies directed at the HLTF marker may be used to detect and/or diagnose a cancer. For example, monoclonal antibodies against the HLTF marker labeled with 99Technetium, 111Indium, 125Iodine--may be effectively used for such imaging. As will be evident to the skilled artisan, the amount of radioisotope to be administered is dependent upon the radioisotope. Those having ordinary skill in the art can readily formulate the amount of the imaging agent to be administered based upon the specific activity and energy of a given radionuclide used as the active moiety. Typically 0.1-100 millicuries per dose of imaging agent, preferably 1-10 millicuries, most often 2-5 millicuries are administered. Thus, compositions according to the present invention useful as imaging agents comprising a targeting moiety conjugated to a radioactive moiety comprise 0.1-100 millicuries, in some embodiments preferably 1-10 millicuries, in some embodiments preferably 2-5 millicuries, in some embodiments more preferably 1-5 millicuries.
[0137]In certain embodiments, the present invention provides drug screening assays for identifying test compounds which potentiate the tumor suppressor function of the HLTF gene. In one aspect, the assays detect test compounds which potentiate the expression level of the HLTF. In another aspect, the assays detect test compounds which inhibit the methylation of the HLTF nucleotide sequences. In certain embodiments, drug screening assays can be generated which detect test compounds on the basis of their ability to interfere with stability or function of the HLTF polypeptide. Alternatively, simple binding assays can be used to detect compounds that inhibit or potentiate the interaction between the HLTF polypeptide and its interacting protein (e.g., Sp1 or Sp3) or the binding of the HLTF polypeptide to a target DNA.
[0138]A variety of assay formats may be used and, in light of the present disclosure, those not expressly described herein will nevertheless considered to be within the purview of ordinary skill in the art. Assay formats can approximate such conditions as HLTF expression level, methylation status of HLTF sequence, tumor suppressing activity, transcriptional activating activity and may be generated in many different forms. In many embodiments, the invention provides assays including both cell-free systems and cell-based assays which utilize intact cells.
[0139]Compounds to be tested can be produced, for example, by bacteria, yeast or other organisms (e.g., natural products), produced chemically (e.g., small molecules, including peptidomimetics), or produced recombinantly. The efficacy of the compound can be assessed by generating dose response curves from data obtained using various concentrations of the test compound. Moreover, a control assay can also be performed to provide a baseline for comparison. In the control assay, the formation of complexes is quantitated in the absence of the test compound.
[0140]In many drug screening programs which test libraries of compounds and natural extracts, high throughput assays are desirable in order to maximize the number of compounds surveyed in a given period of time. Assays of the present invention which are performed in cell-free systems, such as may be developed with purified or semi-purified proteins or with lysates, are often preferred as "primary" screens in that they can be generated to permit rapid development and relatively easy detection of an alteration in a molecular target which is mediated by a test compound. Moreover, the effects of cellular toxicity and/or bioavailability of the test compound can be generally ignored in the in vitro system, the assay instead being focused primarily on the effect of the drug on the molecular target as may be manifest in an alteration of binding affinity with other proteins or changes in enzymatic properties of the molecular target.
[0141]In certain embodiments, test compounds identified from these assays may be used in a therapeutic method for treating an HLTF-associated proliferative disease.
[0142]Still another aspect of the application provides transgenic non-human animals which express a heterologous HLTF gene, or which have had one or more genomic HLTF gene(s) disrupted in at least one of the tissue or cell-types of the animal. For instance, transgenic mice that are disrupted at their HLTF gene locus can be generated.
[0143]In another aspect, the application provides an animal model for an HLTF-associated proliferative disease, which has a mis-expressed HLTF allele. For example, a mouse can be bred which has an HLTF allele deleted, or in which all or part of one or more HLTF exons are deleted. Such a mouse model can then be used to study disorders arising from mis-expression of the HLTF gene.
[0144]Accordingly, the present application discloses transgenic animals which are comprised of cells (of that animal) containing an HLTF transgene and which preferably (though optionally) express an exogenous HLTF protein in one or more cells in the animal. The HLTF transgene can encode the wild-type form of the protein, or can encode homologs thereof, including both agonists and antagonists, as well as antisense constructs. The HLTF transgene can include an HLTF nucleotide sequence (e.g., SEQ ID NOs: 2-4, 21 or 39) or fragments thereof. In preferred embodiments, the expression of the transgene is restricted to specific subsets of cells, tissues or developmental stages utilizing, for example, cis-acting sequences that control expression in the desired pattern.
[0145]Genetic techniques which allow for the expression of transgenes can be regulated via site-specific genetic manipulation in vivo are known to those skilled in the art. For instance, genetic systems are available which allow for the regulated expression of a recombinase that catalyzes the genetic recombination a target sequence. As used herein, the phrase "target sequence" refers to a nucleotide sequence that is genetically recombined by a recombinase. The target sequence is flanked by recombinase recognition sequences and is generally either excised or inverted in cells expressing recombinase activity. Recombinase catalyzed recombination events can be designed such that recombination of the target sequence results in either the activation or repression of expression of the HLTF polypeptides. For example, excision of a target sequence which interferes with the expression of a recombinant HLTF gene can be designed to activate expression of that gene. This interference with expression of the protein can result from a variety of mechanisms, such as spatial separation of the HLTF gene from the promoter element or an internal stop codon. Moreover, the transgene can be made wherein the coding sequence of the gene is flanked recombinase recognition sequences and is initially transfected into cells in a 3' to 5' orientation with respect to the promoter element. In such an instance, inversion of the target sequence will reorient the subject gene by placing the 5' end of the coding sequence in an orientation with respect to the promoter element which allow for promoter driven transcriptional activation.
[0146]In an illustrative embodiment, either the cre/loxP recombinase system of bacteriophage P1 (Lakso et al., (1992) Proc. Natl. Acad. Sci. USA 89:6232-6236; Orban et al., (1992) Proc. Natl. Acad. Sci. USA 89:6861-6865) or the FLP recombinase system of Saccharomyces cerevisiae (O'Gorman et al., (1991) Science 251:1351-1355; PCT publication WO 92/15694) can be used to generate in vivo site-specific genetic recombination systems. Cre recombinase catalyzes the site-specific recombination of an intervening target sequence located between loxP sequences. loxP sequences are 34 base pair nucleotide repeat sequences to which the Cre recombinase binds and are required for Cre recombinase mediated genetic recombination. The orientation of loxP sequences determines whether the intervening target sequence is excised or inverted when Cre recombinase is present (Abremski et al., (1984) J. Biol. Chem. 259:1509-1514); catalyzing the excision of the target sequence when the loxP sequences are oriented as direct repeats and catalyzes inversion of the target sequence when loxP sequences are oriented as inverted repeats.
V. Subjects and Samples
[0147]In certain aspects, the invention relates to a subject suspected of having or has an HLTF-associated disease such as colon neoplasia. Alternatively, a subject may be undergoing routine screening and may not necessarily be suspected of having such an HLTF-associated disease or condition. In a preferred embodiment, the subject is a human subject. and the HLTF associated disease is colon neoplasia.
[0148]Assaying for HLTF markers discussed above in a sample from subjects not known to have a colon neoplasia can aid in diagnosis of such a colon neoplasia in the subject. To illustrate, detecting the methylation status of the HLTF nucleotide sequence by MSP can be used by itself, or in combination with other various assays, to improve the sensitivity and/or specificity for detecting a colon neoplasia. Preferably, such a detection is made at an early stage in the development of cancer, so that treatment is more likely to be effective.
[0149]In addition to diagnosis, assaying of an HLTF marker in a sample from a subject not known to have colon neoplasia, can be prognostic for the subject (i.e., indicating the probable course of the disease). To illustrate, subjects having a predisposition to develop colon neoplasia may possess methylated HLTF nucleotide sequences. Assaying of HLTF markers in a samples from subjects can also be used to select a particular therapy or therapies which are particularly effective against the colon neoplasia in the subject, or to exclude therapies that are not likely to be effective.
[0150]Assaying of HLTF markers in samples from subjects that are known to have, or to have had, a cancer associated with silencing of the HLTF gene is also useful. For example, the present methods can be used to identify whether therapy is effective or not for certain subjects. One or more samples are taken from the same subject prior to and following therapy, and assayed for the HLTF markers. A finding that the HLTF marker is present in the sample taken prior to therapy and absent (or at a lower level) after therapy would indicate that the therapy is effective and need not be altered. In those cases where the HLTF marker is present in the sample taken before therapy and in the sample taken after therapy, it may be desirable to alter the therapy to increase the likelihood that the cancer will be eradicated in the subject. Thus, the present method may obviate the need to perform more invasive procedures which are used to determine a patient's response to therapy.
[0151]Cancers frequently recur following therapy in patients with advanced cancers. In this and other instances, the assays of the invention are useful for monitoring over time the status of an cancer associated with silencing of the HLTF gene. For subjects in which a cancer is progressing, an HLTF marker may be absent from some or all samples when the first sample is taken and then appear in one or more samples when the second sample is taken. For subjects in which cancer is regressing, an HLTF marker may be present in one or a number of samples when the first sample is taken and then be absent in some or all of these samples when the second sample is taken.
[0152]Samples for use with the methods described herein may be essentially any biological material of interest. For example, a sample may be a bodily fluid sample from a subject, a tissue sample from a subject, a solid or semi-solid sample from a subject, a primary cell culture or tissue culture of materials derived from a subject, cells from a cell line, or medium or other extracellular material from a cell or tissue culture, or a xenograft (meaning a sample of a cancer from a first subject, e.g., a human, that has been cultured in a second subject, e.g., an immuno-compromised mouse). The term "sample" as used herein is intended to encompass both a biological material obtained directly from a subject (which may be described as the primary sample) as well as any manipulated forms or portions of a primary sample. A sample may also be obtained by contacting a biological material with an exogenous liquid, resulting in the production of a lavage liquid containing some portion of the contacted biological material. Furthermore, the term "sample" is intended to encompass the primary sample after it has been mixed with one or more additive, such as preservatives, chelators, anti-clotting factors, etc.
[0153]In certain embodiments, a bodily fluid sample is a blood sample. In this case, the term "sample" is intended to encompass not only the blood as obtained directly from the patient but also fractions of the blood, such as plasma, serum, cell fractions (e.g., platelets, erythrocytes, and lymphocytes), protein preparations, nucleic acid preparations, etc. In certain embodiments, a bodily fluid sample is a urine sample or a colonic effluent sample. In certain embodiments, a bodily fluid sample is a stool sample.
[0154]A subject is preferably a human subject, but it is expected that the molecular markers disclosed herein, and particularly their homologs from other animals, are of similar utility in other animals. In certain embodiments, it may be possible to detect an HLTF marker directly in an organism without obtaining a separate portion of biological material. In such instances, the term "sample" is intended to encompass that portion of biological material that is contacted with a reagent or device involved in the detection process.
[0155]In certain embodiments, DNA which is used as the template in an MSP reaction is obtained from a bodily fluid sample. Examples of preferred bodily fluids are blood, serum, plasma, a blood-derived fraction, stool, colonic effluent or urine. Other body fluids can also be used. Because they can be easily obtained from a subject and can be used to screen for multiple diseases, blood or blood-derived fractions are especially useful. For example, it has been shown that DNA alterations in colorectal cancer patients can be detected in the blood of subjects (Hibi, et al., 1998, Cancer Res, 58:1405-7). Blood-derived fractions can comprise blood, serum, plasma, or other fractions. For example, a cellular fraction can be prepared as a "buffy coat" (i.e., leukocyte-enriched blood portion) by centrifuging 5 ml of whole blood for 10 min at 800 times gravity at room temperature. Red blood cells sediment most rapidly and are present as the bottom-most fraction in the centrifuge tube. The buffy coat is present as a thin creamy white colored layer on top of the red blood cells. The plasma portion of the blood forms a layer above the buffy coat. Fractions from blood can also be isolated in a variety of other ways. One method is by taking a fraction or fractions from a gradient used in centrifugation to enrich for a specific size or density of cells.
[0156]DNA is then isolated from samples from the bodily fluids. Procedures for isolation of DNA from such samples are well known to those skilled in the art. Commonly, such DNA isolation procedures comprise lysis of any cells present in the samples using detergents, for example. After cell lysis, proteins are commonly removed from the DNA using various proteases. RNA is removed using RNase. The DNA is then commonly extracted with phenol, precipitated in alcohol and dissolved in an aqueous solution.
VI. Therapeutic Methods for HLTF-Associated Diseases
[0157]Yet another aspect of this application pertains to methods of treating an HLTF-associated proliferative disease which arises from reduced expression or over-expression of the HLTF gene in cells. Such HLTF-associated proliferative diseases (for example, a colon neoplasia) can result from a wide variety of pathological cell proliferative conditions. In certain embodiments, treatment of an HLTF-associated proliferative disorder includes modulation of the HLTF gene expression or HLTF activity. The term "modulate" envisions the suppression of expression of HLTF when it is over-expressed, or augmentation of HLTF expression when it is under-expressed.
[0158]In an embodiment, the present invention provides a therapeutic method by using an HLTF gene construct as a part of a gene therapy protocol, such as to reconstitute the function of an HLTF protein (e.g., SEQ ID NO: 1) in a cell in which the HLTF protein is mis-expressed or non-expressed. To illustrate, cell types which exhibit pathological or abnormal growth presumably depend at least in part on a function of a HLTF protein. For example, gene therapy constructs encoding the HLTF protein can be utilized in a colon neoplasia that is associated with silencing of the HLTF gene.
[0159]In certain embodiments, the invention provides therapeutic methods using agents which induce re-expression of HLTF. Loss of HLTF gene expression in an HLTF-associated diseased cells may be due at least in part to methylation of the HLTF nucleotide sequence, methylation suppressive agents such as 5-deoxyazacytidine or 5-azacytidine can be introduced into the diseased cells. Other similar agents will be known to those of skill in the art. A preferred embodiments the HLTF-associated disease is colon neoplasia associated with increased methylation of HLTF nucleotide sequences.
[0160]In certain embodiments, the invention provides therapeutic methods using a nucleic acid approach, for example, antisense nucleic acid, ribozymes or triplex agents, to block transcription or translation of a specific HLTF mRNA, either by masking that mRNA with an antisense nucleic acid or triplex agent or by cleaving it with a ribozyme. Such disorders include neurodegenerative diseases, for example. Antisense nucleic acids are DNA or RNA molecules that are complementary to at least a portion of a specific mRNA molecule (Weintraub, Scientific American, 262:40, 1990). In the cell, the antisense nucleic acids hybridize to the corresponding mRNA, forming a double-stranded molecule. The antisense nucleic acids interfere with the translation of the mRNA, since the cell will not translate an mRNA that is double-stranded. Antisense oligomers of about 15 nucleotides are preferred, since they are easily synthesized and are less likely to cause problems than larger molecules when introduced into a target HLTF over-producing cell. Use of an oligonucleotide to stall transcription is known as the triplex strategy since the oligomer winds around double-helical DNA, forming a three-strand helix. Therefore, these triplex compounds can be designed to recognize a unique site on a chosen gene (Maher, et al., Antisense Res. and Dev., 1(3):227, 1991; Helene, C., Anticancer Drug Design, 6(6):569, 1991). Ribozymes are RNA molecules possessing the ability to specifically cleave other single-stranded RNA in a manner analogous to DNA restriction endonucleases. Through the modification of nucleotide sequences which encode these RNAs, it is possible to engineer molecules that recognize specific nucleotide sequences in an RNA molecule and cleave it (Cech, J. Amer. Med. Assn., 260:3030, 1988).
[0161]The present invention also provides gene therapy for the treatment of proliferative or immunologic disorders which are mediated by HLTF protein. Such therapy would achieve its therapeutic effect by introduction of the HLTF antisense polynucleotide into cells having the proliferative disorder. Alternatively, it may be desirable to introduce polynucleotides encoding full-length HLTF into diseased cells.
[0162]Delivery of antisense HLTF polynucleotide or the HLTF gene can be achieved using a recombinant expression vector such as a chimeric virus or a colloidal dispersion system. Especially preferred for therapeutic delivery of antisense sequences is the use of targeted liposomes. Various viral vectors which can be utilized for gene therapy as taught herein include adenovirus, herpes virus, vaccinia, or, preferably, an RNA virus such as a retrovirus. Preferably, the retroviral vector is a derivative of a murine or avian retrovirus. Examples of retroviral vectors in which a single foreign gene can be inserted include, but are not limited to: Moloney murine leukemia virus (MoMuLV), Harvey murine sarcoma virus (HaMuSV), murine mammary tumor virus (MuMTV), and Rous Sarcoma Virus (RSV). Preferably, when the subject is a human, a vector such as the gibbon ape leukemia virus (GaLV) is utilized. A number of additional retroviral vectors can incorporate multiple genes. All of these vectors can transfer or incorporate a gene for a selectable marker so that transduced cells can be identified and generated. By inserting an HLTF sequence of interest into the viral vector, along with another gene which encodes the ligand for a receptor on a specific target cell, for example, the vector is target-specific. Retroviral vectors can be made target-specific by attaching, for example, a sugar, a glycolipid or a protein. Preferred targeting is accomplished by using an antibody to target the retroviral vector. Those skilled in the art will know of, or can readily ascertain without undue experimentation, specific polynucleotide sequences which can be inserted into the retroviral genome or attached to a viral envelope to allow target-specific delivery of the retroviral vector containing antisense HLTF polynucleotide or the HLTF gene.
[0163]The invention also relates to a medicament or pharmaceutical composition comprising an HLTF 5' flanking polynucleotide or an HLTF 5' flanking polynucleotide operably linked to the HLTF structural gene, respectively, in a pharmaceutically acceptable excipient or medium wherein the medicament is used for therapy of HLTF-associated cell proliferative disorders, such as a colon neoplasia.
Exemplification
[0164]The invention now being generally described, it will be more readily understood by reference to the following examples, which are included merely for purposes of illustration of certain aspects and embodiments of the present invention, and are not intended to limit the invention.
1. Amplification and Sequencing of HLTF cDNA.
[0165]HLTF cDNA sequence was obtained from Genbank (accession No. Z46606) and primers were chosen by using MacVector software (Oxford Molecular). The HLTF cDNA was amplified in two overlapping fragments (5' fragment 286F/1381R and 3' fragment 1317F/3456R). Primer sequences were as follows: HLTF-286F (5'-GCTCCTCTTGTCATCCCACTCA), HLTF-1381R (5'-CGTCTTTGCTTAGTCCATCTGCCTT), HLTF-1317F (5''-CGATGGTCTATGAAACTTGGA), and HLTF-3456R (5'-GAAATTGTGTCAGTAATACCTCTTCAC). The HLTF 5' genomic sequence was identified from Genbank genomic clone sequence (NT--005616).
2. Methylation-Specific PCR (MS-PCR).
[0166]500 ng DNA from each sample in a volume of 50 ul were denatured by NaOH (freshly made, final concentration, 0.2 M) at 37° C. for 15 min. Next, 30 ul 10 mM hydroquinone (fresh) and 520 ul 3.0 M NaHSO4 (freshly prepared sodium bisulfite, pH5.0) were added, and incubated at 55° C. for 16 hrs. Modified DNA was purified using Wizard DNA Clean-Up System (Promega). The reaction was desulphonated by NaOH at a final concentration of 0.3 M at room temperature for 15 min and neutralized by adding 10 M NH4OAc, pH7.0, to a final concentration of 3 M. DNA was precipitated with 3 volumes of absolute ethanol for 30 min at -80° C. The DNA pellet was then dissolved in distilled water to give approximately 10 ng/ul. Sodium bisulfite treated DNA was used as the template for subsequent methylation-specific PCR.
[0167]The primer sequences were based on the HLTF 5' genomic sequence and were specific for fully modified DNA. Primer set for the methylated DNA are P-HLTF1352MF: 5'-TGGGGTTTCGTGGTTTTTTCGCGC-3' and P-HLTF1606MR: 5'-CCGCGAATCCAATCAAACGTCGACG-3', which gives 254 bp product. The primer set for the unmethylated DNA are P-HLTF1347UF: 5'-ATTTTTGGGGTTTTGTGGTTTTTTTGTGT-3' and P-HLTF1610UR: ATCACCACAAATCCAATCAAACATCAACA-3', which amplify 284 bp fragment. PCR was carried out using a hot start at 95° C. (9 minutes) and the following cycling parameters: 33 cycles of 95° C. (45 s), 66° C. (45 s), 72° C. (45 s), 72° C. (5 minutes), and 4° C. to cool. The PCR products were run on 3.0% agarose gel.
3. Cell Culture and 5-Azacytidine Treatment.
[0168]The cultures were grown and treated as described previously (Veigl, et al., 1998, Proc. Natl. Acad. Sci. USA, 95:8698-8702). The optimal tolerated doses were determined for each treated line, and two doses were used for some lines, ranging from 1 μg/ml to 3 μg/ml.
4. Clonogenic Assays and Transfections.
[0169]Cells were plated in 6-well dishes (12,000-20,00 cells/well) 24 hours before transfection in a regular growing medium and transfected with 0.4 μg DNA/well with effectene (QIAGEN) according to manufacturers protocol. G418 (0.5 mg/ml for FET, 0.6 mg/ml for V457) was added to the wells 48 hours after transfection, and cell were kept in G418 media (replaced bi-weekly) for 3 weeks, until tight colonies were observed. Colonies were stained with trypan blue and counted.
5. Statistical Methods.
[0170]Comparisons of HLTF methylation with sex, MLH1 methylation status, and CIMP (i.e., CpG island methylator phenotype) status were done using a two tailed Fisher's exact test. Comparison of HLTF methylation status with tumor site or stage was done using a Pearson's chi-squared test, with test for trend using a Mantel chi-squared test. Comparison of age distribution of smarca3 methylation in cancers and in normal tissue was done using a Wilcoxin non-parametric test.
6. HLTF is not Mutated in Colon Cancers.
[0171]As several SWI/SNF family genes have been found to be altered in human cancers, we first determined the sequence of the HLTF cDNA amplified by RT-PCR from 34 colon cancer cell lines matched to primary patient samples in our colon cancer bank. Only one mutation was detected, a hemizygous nonsense mutation at codon 979. Thus HLTF is not a common target for gene mutation in colon cancer.
7. HLTF is Frequently Methylated and Silenced in Colon Cancer Cell Lines.
[0172]In the process of HLTF sequence analysis in colon cancer cell lines we noted that 9 out of 34 of these cell lines did not express HLTF cDNA (FIG. 1A). Southern analysis did not identify any alterations in the HLTF locus. Coincidentally, in some of the cell lines that had lost HLTF expression we previously had demonstrated silencing of the hMLH1 gene due to promoter methylation (Veigl, et al., 1998, Proc. Natl. Acad. Sci. USA, 95:8698-8702). We therefore examined the genomic sequence upstream of and within the HLTF gene (herein referred to as 5'-HLTF genomic sequence) which contained a CpG dense region that could potentially be methylated (FIG. 2A). No TATA box consensus sequence was found within this region of 5' HLTF genomic sequence. However, it did contain a consensus initiator element, and two SP1 sites that are typical of TATA-less housekeeping gene promoters.
[0173]To test for methylation of this CpG-rich region, we used the technique of methylation specific PCR (Herman, et al., 1996, Proc. Natl. Acad. Sci. USA, 93:9821-9826), employing PCR primers specific for amplification of either methylated or unmethylated DNA templates (FIG. 2A). As shown in FIG. 2B, all colon cancer cell lines that lacked HLTF gene expression demonstrated methylation of CpG sites within the 5' HLTF genomic sequence; whereas, methylation was not detected in the HLTF expressing cell lines. These results were confirmed by two independent MS-PCR assays that tested different HLTF CpG sites, as well as by resistance of the 5' HLTF genomic sequence to digestion with a methylation-sensitive restriction enzyme, HpaII enzyme. Thus, cell lines that had silenced the HLTF gene demonstrated methylation across the CpG sites within this entire region, whereas HLTF-expressing cell lines assayed as free of methylated CpG sites.
[0174]For three of these HLTF methylated cell lines, DNA from matched normal and antecedant tumor DNA was additionally available (V6, V8, and V432). In each of these cases, HLTF DNA methylation was detected in the primary tumors, but was absent in the matched normal tissues (FIG. 2C), verifying that HLTF methylation and silencing was a true somatic event and was not an artifact of cell line cultures.
8. Re-Induction of HLTF Expression.
[0175]To establish that methylation was responsible for silencing HLTF gene expression, cell lines with HLTF DNA methylation were treated with 5-azacytidine (5-azaC), a demethylating agent. As shown in FIG. 1B, 5-azaC treatment reactivated HLTF expression in all these cell lines, though Vaco457 required higher dose of 5-azaC for reactivation, compared to other cell lines. However, 5-AzaC did not further increase HLTF expression in control cell lines in which HLTF expression was constitutive and in which the basal HLTF DNA was unmethylated.
9. HLTF Methylation is Widespread in Primary Colon Cancer.
[0176]To further establish the frequency of HLTF methylation in primary colon cancer tumors, we analyzed 63 pairs of primary colon tumors along with matched normal tissues (FIG. 3). HLTF methylation was detected in 27 of 63 (45%) colon cancer cases. In contrast, no evidence of HLTF gene silencing was detected in 30 lung tumor cell lines, or 8 breast cancer samples tested.
[0177]The finding of HLTF methylation in colon cancer tumors and cell lines was not correlated with patients' sex (p==0.31) or with age (p=0.14) (FIG. 4A), with a median age of 72 in persons with HLTF methylated cancers versus 68 in those with HLTF unmethylated cancers. In the overwhelming majority of cases (84%), HLTF methylation was detected only in the colon cancers, and was absent from the same individuals' normal colon tissues. HLTF methylation thus substantially arose in these individuals specifically during the neoplastic process. However, in 16% of individuals whose colon cancers demonstrated HLTF methylation, very faint HLTF methylation was also detectable in histologically normal colon tissue.
[0178]This was also the case in a small subset of all individuals from whom normal colon tissues were available to us, among whom faint HLTF methylation was detectable in 9 out of 78 normal colon samples (12%). Normal colon tissues showing faint HLTF methylation were in general those derived from the oldest individuals studied, with a median age of 81 for individuals demonstrating faint HLTF methylation in normal colon tissues versus 67 for those with whose normal colon tissue showed only unmethylated HLTF (p=0.02) (FIG. 4B). We cannot determine whether HLTF methylated cells when detected in normal colon tissues are derived from contamination by cancer cells derived from a frank cancer that was concurrently resected in the same colon specimen from which our normal sample was taken, are derived from microscopic early colon neoplasms, or alternatively whether in some instances HLTF methylation can be initiated in the aging colon separately from neoplasia, either as a stochastic event, or in response to endogenous or exogenous genotoxins. However, we favor the explanation that these signals arise from contaminating cancer cells, as no HLTF methylation was detected in colons from any of 12 individuals undergoing colon resection for non-malignant disease. Compared to HLTF expressing cancers, cancers with methylated and silenced HLTF alleles showed a borderline significant trend (p=0.06) to be more likely to arise in the proximal right colon and less likely to arise in the left colon or rectum, a trend similar to that previously observed for HNPCC and sporadic MSI colon cancers (Kinzler, et al., 1996, Cell, 87:159-170) (FIG. 5A vs. 5B). The distribution by tumor stage (adenoma; Dukes' stage B, C or D cancer primary; or metastatic lesion) was also significantly different between HLTF methylated and non-methylated colon neoplasms (p=0.02). An a postiori grouping of the tumors into a non-metastatic subset and a metastatic subset (Dukes D primary cancers or cancers from distant metastatic sites) suggests the hypothesis that this is due to a lesser likelihood of HLTF methylated tumors being metastastic (nominal p value=0.01) (FIG. 5C vs. 5D).
[0179]To determine the timing of onset of HLTF silencing during colon carcinogenesis, we additionally analyzed a group of 14 early and late adenomas for HLTF CpG island methylation. HLTF methylation was detected in 3 of the adenomas tested, all of which were greater than 1.5 cm in size, suggesting that HLTF methylation can occur as early as the late adenoma stage of colon neoplasia. Detection of HLTF methylation may thus be of value for detecting the early and most curable stages of colon neoplasia.
10. HLTF Methylation Defines a Singular Group of Colon Cancers.
[0180]Recently, it has been suggested that certain colon cancers are typified by a high frequency of gene promoter methylation and represent a distinct pathway termed the CpG island methylator phenotype (CIMP+) (Toyota, et al., 1999, Proc. Natl. Acad. Sci. USA, 96:8681-8686; Toyota, et al., 2000, Proc. Natl. Acad. Sci. USA, 97:710-715). Tumors exhibiting this phenotype (CIMP+) show concordant CpG island methylation affecting multiple genes, including hMLH1, p16, and THBS1. To establish whether HLTF methylation correlates with hMLH1 methylation, and/or with the CIMP+ phenotype, 87 colon cancer cases examined for HLTF methylation were also examined for hMLH1 methylation, and 64 were further assayed for CIMP+ or CIMP-phenotype as determined by methylation status of MINT1, MINT2, MINT31, and MINT27 locii (Toyota, et al., 1999, Proc. Natl. Acad. Sci. USA, 96:8681-8686). HLTF methylation correlated with CIMP+ phenotype (p<0.001) (FIG. 6A) and as well with hMLH1 gene methylation (p<0.0001) (FIG. 6B). However, HLTF-methylated tumors essentially defined a distinct subclass of colon cancers that did not fall exclusively into either the hMLH1 methylated or CIMP+ groups.
[0181]To further determine whether HLTF methylation defines a singular group of colon cancers, we used restriction landmark genomic scanning (RLGS) analysis (Costello, et al., 2000, Nat. Genet., 24:132-138) to compare the patterns of global genome methylation in a group of twelve colon cancer cell lines, six of which demonstrated HLTF methylation and silencing and in six of which HLTF was unmethylated and expressed. 497 loci demonstrated methylation present in at least one of the 12 colon cancer cell lines.
[0182]However, none of these loci demonstrated the presence of methylation across the six colon cancers in which HLTF was methylated and silenced, as well as the absence of methylation across the 6 colon cancers expressing an unmethylated HLTF allele. While RLGS analysis samples only a portion of the genome, this data independently suggests that HLTF methylation is a unique event, and does not necessarily reflect a genome-wide increase in promoter methylation.
11. HLTF Reconstitution Induces Growth Suppression.
[0183]The high frequency of HLTF methylation observed in colon cancer suggested that inactivation of this gene might confer a selective advantage. To assay for such an advantage we examined the effect of HLTF transfection on colony formation in three HLTF methylated and non-expressing colon cancer cell lines (V457, V8-2, RCA) as compared to three HLTF unmethylated and expressing colon cancer cell lines (FET, V364, V429). Reconstitution of HLTF expression in HLTF methylated cells suppressed colony forming ability by 75% in each of the three lines tested (p<0.0001 for each) (FIG. 7A). In contrast, transfection of HLTF did not show significant colony suppression in the any of the three cell lines that already expressed endogenous HLTF (FIG. 7B). Growth suppression by exogenous HLTF was thus specific to colon cancers that had silenced the endogenous alleles (p<0.01 for the difference in effect of HLTF transfection in HLTF methylated versus unmethylated cell lines). Transient transfections showed both the HLTF methylated and unmethylated cells were well able to express exogenous HLTF, as determine by Western analysis for a V5 epitope tag attached to HLTF in the expression vector (FIG. 7C). These findings suggest that HLTF methylation and silencing indeed confers a growth advantage in a distinct subclass of colon cancers.
[0184]In sum, certainly future studies can be expected to further elucidate the presumptive pathogenetic role that we suggest for HLTF inactivation in colon cancer. Moreover, the high frequency of HLTF methylation in colon cancer may also be useful in potential translational applications. We and others have shown that methylated promoter DNA can be detected in the blood of some cancer patients (Grady, et al., 2001, Cancer Res., 61:900-902). Thus, it will also be attractive to explore the possibility that assays for methylation of HLTF in body fluids may be of future value for early detection of colon cancer incidence, relapse or prognosis.
INCORPORATION BY REFERENCE
[0185]All publications and patents mentioned herein are hereby incorporated by reference in their entirety as if each individual publication or patent was specifically and individually indicated to be incorporated by reference. In case of conflict, the present application, including any definitions herein, will control.
EQUIVALENTS
[0186]While specific embodiments of the subject invention have been discussed, the above specification is illustrative and not restrictive. Many variations of the invention will become apparent to those skilled in the art upon review of this specification and the claims below. The full scope of the invention should be determined by reference to the claims, along with their full scope of equivalents, and the specification, along with such variations.
Sequence CWU
1
5111009PRTHomo sapiens 1Met Ser Trp Met Phe Lys Arg Asp Pro Val Trp Lys
Tyr Leu Gln Thr1 5 10
15Val Gln Tyr Gly Val His Gly Asn Phe Pro Arg Leu Ser Tyr Pro Thr
20 25 30Phe Phe Pro Arg Phe Glu Phe
Gln Asp Val Ile Pro Pro Asp Asp Phe 35 40
45Leu Thr Ser Asp Glu Glu Val Asp Ser Val Leu Phe Gly Ser Leu
Arg 50 55 60Gly His Val Val Gly Leu
Arg Tyr Tyr Thr Gly Val Val Asn Asn Asn65 70
75 80Glu Met Val Ala Leu Gln Arg Asp Pro Asn Asn
Pro Tyr Asp Lys Asn 85 90
95Ala Ile Lys Val Asn Asn Val Asn Gly Asn Gln Val Gly His Leu Lys
100 105 110Lys Glu Leu Ala Gly Ala
Leu Ala Tyr Ile Met Asp Asn Lys Leu Ala 115 120
125Gln Ile Glu Gly Val Val Pro Phe Gly Ala Asn Asn Ala Phe
Thr Met 130 135 140Pro Leu His Met Thr
Phe Trp Gly Lys Glu Glu Asn Arg Lys Ala Val145 150
155 160Ser Asp Gln Leu Lys Lys His Gly Phe Lys
Leu Gly Pro Ala Pro Lys 165 170
175Thr Leu Gly Phe Asn Leu Glu Ser Gly Trp Gly Ser Gly Arg Ala Gly
180 185 190Pro Ser Tyr Ser Met
Pro Val His Ala Ala Val Gln Met Thr Thr Glu 195
200 205Gln Leu Lys Thr Glu Phe Asp Lys Leu Phe Glu Asp
Leu Lys Glu Asp 210 215 220Asp Lys Thr
His Glu Met Glu Pro Ala Glu Ala Ile Glu Thr Pro Leu225
230 235 240Leu Pro His Gln Lys Gln Ala
Leu Ala Trp Met Val Ser Arg Glu Asn 245
250 255Ser Lys Glu Leu Pro Pro Phe Trp Glu Gln Arg Asn
Asp Leu Tyr Tyr 260 265 270Asn
Thr Ile Thr Asn Phe Ser Glu Lys Asp Arg Pro Glu Asn Val His 275
280 285Gly Gly Ile Leu Ala Asp Asp Met Gly
Leu Gly Lys Thr Leu Thr Ala 290 295
300Ile Ala Val Ile Leu Thr Asn Phe His Asp Gly Arg Pro Leu Pro Ile305
310 315 320Glu Arg Val Lys
Lys Asn Leu Leu Lys Lys Glu Tyr Asn Val Asn Asp 325
330 335Asp Ser Met Lys Leu Gly Gly Asn Asn Thr
Ser Glu Lys Ala Asp Gly 340 345
350Leu Ser Lys Asp Ala Ser Arg Cys Ser Glu Gln Pro Ser Ile Ser Asp
355 360 365Ile Lys Glu Lys Ser Lys Phe
Arg Met Ser Glu Leu Ser Ser Ser Arg 370 375
380Pro Lys Arg Arg Lys Thr Ala Val Gln Tyr Ile Glu Ser Ser Asp
Ser385 390 395 400Glu Glu
Ile Glu Thr Ser Glu Leu Pro Gln Lys Met Lys Gly Lys Leu
405 410 415Lys Asn Val Gln Ser Glu Thr
Lys Gly Arg Ala Lys Ala Gly Ser Ser 420 425
430Lys Val Ile Glu Asp Val Ala Phe Ala Cys Ala Leu Thr Ser
Ser Val 435 440 445Pro Thr Thr Lys
Lys Lys Met Leu Lys Lys Gly Ala Cys Ala Val Glu 450
455 460Gly Ser Lys Lys Thr Asp Val Glu Glu Arg Pro Arg
Thr Thr Leu Ile465 470 475
480Ile Cys Pro Leu Ser Val Leu Ser Asn Trp Ile Asp Gln Phe Gly Gln
485 490 495His Ile Lys Ser Asp
Val His Leu Asn Phe Tyr Val Tyr Tyr Gly Pro 500
505 510Asp Arg Ile Arg Glu Pro Ala Leu Leu Ser Lys Gln
Asp Ile Val Leu 515 520 525Thr Thr
Tyr Asn Ile Leu Thr His Asp Tyr Gly Thr Lys Gly Asp Ser 530
535 540Pro Leu His Ser Ile Arg Trp Leu Arg Val Ile
Leu Asp Glu Gly His545 550 555
560Ala Ile Arg Asn Pro Asn Ala Gln Gln Thr Lys Ala Val Leu Asp Leu
565 570 575Glu Ser Glu Arg
Arg Trp Val Leu Thr Gly Thr Pro Ile Gln Asn Ser 580
585 590Leu Lys Asp Leu Trp Ser Leu Leu Ser Phe Leu
Lys Leu Lys Pro Phe 595 600 605Ile
Asp Arg Glu Trp Trp His Arg Thr Ile Gln Arg Pro Val Thr Met 610
615 620Gly Asp Glu Gly Gly Leu Arg Arg Leu Gln
Ser Leu Ile Lys Asn Ile625 630 635
640Thr Leu Arg Arg Thr Lys Thr Ser Lys Ile Lys Gly Lys Pro Val
Leu 645 650 655Glu Leu Pro
Glu Arg Lys Val Phe Ile Gln His Ile Thr Leu Ser Asp 660
665 670Glu Glu Arg Lys Ile Tyr Gln Ser Val Lys
Asn Glu Gly Arg Ala Thr 675 680
685Ile Gly Arg Tyr Phe Asn Glu Gly Thr Val Leu Ala His Tyr Ala Asp 690
695 700Val Leu Gly Leu Leu Leu Arg Leu
Arg Gln Ile Cys Cys His Thr Tyr705 710
715 720Leu Leu Thr Asn Ala Val Ser Ser Asn Gly Pro Ser
Gly Asn Asp Thr 725 730
735Pro Glu Glu Leu Arg Lys Lys Leu Ile Arg Lys Met Lys Leu Ile Leu
740 745 750Ser Ser Gly Ser Asp Glu
Glu Cys Ala Ile Cys Leu Asp Ser Leu Thr 755 760
765Val Pro Val Ile Thr His Cys Ala His Val Phe Cys Lys Pro
Cys Ile 770 775 780Cys Gln Val Ile Gln
Asn Glu Gln Pro His Ala Lys Cys Pro Leu Cys785 790
795 800Arg Asn Asp Ile His Glu Asp Asn Leu Leu
Glu Cys Pro Pro Glu Glu 805 810
815Leu Ala Arg Asp Ser Glu Lys Lys Ser Asp Met Glu Trp Thr Ser Ser
820 825 830Ser Lys Ile Asn Ala
Leu Met His Ala Leu Thr Asp Leu Arg Lys Lys 835
840 845Asn Pro Asn Ile Lys Ser Leu Val Val Ser Gln Phe
Thr Thr Phe Leu 850 855 860Ser Leu Ile
Glu Ile Pro Leu Lys Ala Ser Gly Phe Val Phe Thr Arg865
870 875 880Leu Asp Gly Ser Met Ala Gln
Lys Lys Arg Val Glu Ser Ile Gln Cys 885
890 895Phe Gln Asn Thr Glu Ala Gly Ser Pro Thr Ile Met
Leu Leu Ser Leu 900 905 910Lys
Ala Gly Gly Val Gly Leu Asn Leu Ser Ala Ala Ser Arg Val Phe 915
920 925Leu Met Asp Pro Ala Trp Asn Pro Ala
Ala Glu Asp Gln Cys Phe Asp 930 935
940Arg Cys His Arg Leu Gly Gln Lys Gln Glu Val Ile Ile Thr Lys Phe945
950 955 960Ile Val Lys Asp
Ser Val Glu Glu Asn Met Leu Lys Ile Gln Asn Lys 965
970 975Lys Arg Glu Leu Ala Ala Gly Ala Phe Gly
Thr Lys Lys Pro Asn Ala 980 985
990Asp Glu Met Lys Gln Ala Lys Ile Asn Glu Ile Arg Thr Leu Ile Asp
995 1000 1005Leu23000DNAHomo sapiens
2attgccattt gcctcaacac acttcaagaa gaacatacaa caattaaaat aatttgaaat
60gccattgtct ttatcttcct catcaacttt ggtcaaatga aataatacat tgtttcaaat
120tctattgctc atccatggtt catgattaga gatcactaca gtttatttac gataatgaac
180aaatgtgtct tgaaaagctg actaccggcc tggcacggtg gctcatgcct gtaatcccag
240cactctggga ggccgaggcg ggcggatcac ctgaggtcag gagttcaaga ccagcctggc
300caatatggtg aaaccccgtc tctactaaaa atacaaaaat tagctgggcg tggtggcggg
360ctcctgcaat cccggctact ccggaggctg aggcaggaga atcgcttgaa tccgggaggc
420ggaggttgca gtgagccaag atcaccccca ttgcactcca gcctggggga cagagcaaga
480ctccatctca ataaaaaaga aaaaaaaaca aaaaaaagaa aaaagaaaag ctgaccacct
540aatattattg tgctgacagg cgtgtaacat ggacctctct gtgacgcgca agtattatct
600ggcagttaca tggtgccgcc tgtgcagctg ggcgccacca ccagcttcct gccctgtgcg
660attaaaaggc cagcttcggg gacagcagcc tcctcaacgc aaagcgacca gccctgccag
720gcgtcggctg cagctggtcc aaggttgacg gtcccagggc cgcattctcc ttccggtcac
780ctcagactct ggaaagagag ccatggaggg cctcagagcg cccctctgca gttccttgag
840gaaaacaagg gcctccccct cttggctgga gcagccaagt gcccacgggg aaaatggtcc
900cgtccaccat ttcaggcaaa cgcccccata gctgtaataa acatgtgatg ggtttcctgc
960accaagaaag gaaaattttc cttaaaaata acaaaaagag gtggggaggg gactctaagt
1020tccatttaag attttccaaa ctcgttgtgt ctcatgcata cgctgaggct taacatgaaa
1080atttggagcg aaagtcccca cggttcacca cagggaatga agtaaagtat ctcgccggta
1140gctctaggtc tcgtaaaccg tggagctcct ttagtataag gccctattat gtgccacatt
1200agatgagaag aaaataagag gaaggatacg gtagagcctt tagaatgggg ttcagtctta
1260ggactcgtag gataaaggaa ggtcgtttcc ctccgtttga ggcagggaat caaacaaaac
1320accggcaccg caggcaccgc agtcgcactc ctggggcctc gtggctttcc cgcgcgcccg
1380ccttgggggc ggggaagaac ccggatggaa ccacccttgc agcccggacc ccccgccgtc
1440tttgaatggc agcggggcgg agttgccctc ccttctgtgc tctgactggt ttggctccgc
1500actctcctct tcgtgattgg gctttcctag tgccagtcac agagcgacgc tggtctccca
1560gattgttgca gaaggagacg gcgtcgacgt ctgactggac tcgcggcgac ttacctttca
1620gtcgtgcgct cctgatccgg cgctcggaat ttgtccccgg cttcagggct gcggggcctg
1680gaaggaggcg tatcgaggcg gctcgaaaac gatccagggg agccgaggcg ctcctcttgt
1740catcccactc agcgccatgt cctggatgtt caagaggtga agggggcgga gggggtgggg
1800cgctcggtct aacggcctgg aggcgtcccc aaatgaacct gaccttcccg gcgttcctct
1860gcgttccctg ggcgatttgt gcagctgtat tcgttctgtt ggtcgcatat gtggccgccg
1920gagaaataaa tgcattgtct tcgctggcga gtaggggctc ctagggcgag tcccgtgtta
1980gggacttggg aaatctctgt caccgactgt ggggggcctg gggcttaaag catctcggcc
2040agtgctttat ttaacaaaac atccattaga tacccactgt gcgccaggca ttgtgctcgg
2100tggagggccg agtaagacag tccttgtccc caagtggctc acagcttatc tgtatgcaca
2160gacgtgcaga taagtatttg cagaggcgct agtagagata tgcgccactt gcaaggaaaa
2220cgcatcgcaa ataagcgact gttcttagga gaatggaaac tttttgcaat gctccagtag
2280actccccttc acttttcatt gtccaaattg gtatttatgg tcattcccga aacagttaca
2340cctaatgggt ttggggttat gcttaaaccg gcgacggaag gcgggtacgg gtggcttggg
2400gccaatttcc cacggtggcg tggtttggat accagaatag gtgttctgct cgcagggagt
2460tacagtgatg gatgctggat gtattttctg gaaaattata aacaacttct ggtacattaa
2520tagttgtatt tttataagtc ttttatatag ctcaatccct attgaaattt tggcatatga
2580tttcattttt taaattattt ttttgacatg gagtctcgct ctgtcgtcca ggctggagtg
2640cagtggcaca atctcggctc gctgcaacct acgcctcccg ggttcaagtg attctcctgc
2700ctcagcctcc cgagtagctg ggattacata tgggcgccac cacgcctggc taatttttgt
2760atttgtagta gacagggttt cacaatgttg gccaggttgg tctcaaattc ctgacctcag
2820gtgatccacc tgcctctgcc tcccaaagtg ctgggattcc aggcgtgagc caccgcatcc
2880ggcccataat tgtagttaca gacatcatat acatatcaat ttagatgatt ttcactggga
2940aagaggttag tgaactctga ctaggctttt cattttgaat acctaggggg caggtagaag
300032001DNAHomo sapiens 3tggcagttac atggtgccgc ctgtgcagct gggcgccacc
accagcttcc tgccctgtgc 60gattaaaagg ccagcttcgg ggacagcagc ctcctcaacg
caaagcgacc agccctgcca 120ggcgtcggct gcagctggtc caaggttgac ggtcccaggg
ccgcattctc cttccggtca 180cctcagactc tggaaagaga gccatggagg gcctcagagc
gcccctctgc agttccttga 240ggaaaacaag ggcctccccc tcttggctgg agcagccaag
tgcccacggg gaaaatggtc 300ccgtccacca tttcaggcaa acgcccccat agctgtaata
aacatgtgat gggtttcctg 360caccaagaaa ggaaaatttt ccttaaaaat aacaaaaaga
ggtggggagg ggactctaag 420ttccatttaa gattttccaa actcgttgtg tctcatgcat
acgctgaggc ttaacatgaa 480aatttggagc gaaagtcccc acggttcacc acagggaatg
aagtaaagta tctcgccggt 540agctctaggt ctcgtaaacc gtggagctcc tttagtataa
ggccctatta tgtgccacat 600tagatgagaa gaaaataaga ggaaggatac ggtagagcct
ttagaatggg gttcagtctt 660aggactcgta ggataaagga aggtcgtttc cctccgtttg
aggcagggaa tcaaacaaaa 720caccggcacc gcaggcaccg cagtcgcact cctggggcct
cgtggctttc ccgcgcgccc 780gccttggggg cggggaagaa cccggatgga accacccttg
cagcccggac cccccgccgt 840ctttgaatgg cagcggggcg gagttgccct cccttctgtg
ctctgactgg tttggctccg 900cactctcctc ttcgtgattg ggctttccta gtgccagtca
cagagcgacg ctggtctccc 960agattgttgc agaaggagac ggcgtcgacg tctgactgga
ctcgcggcga cttacctttc 1020agtcgtgcgc tcctgatccg gcgctcggaa tttgtccccg
gcttcagggc tgcggggcct 1080ggaaggaggc gtatcgaggc ggctcgaaaa cgatccaggg
gagccgaggc gctcctcttg 1140tcatcccact cagcgccatg tcctggatgt tcaagaggtg
aagggggcgg agggggtggg 1200gcgctcggtc taacggcctg gaggcgtccc caaatgaacc
tgaccttccc ggcgttcctc 1260tgcgttccct gggcgatttg tgcagctgta ttcgttctgt
tggtcgcata tgtggccgcc 1320ggagaaataa atgcattgtc ttcgctggcg agtaggggct
cctagggcga gtcccgtgtt 1380agggacttgg gaaatctctg tcaccgactg tggggggcct
ggggcttaaa gcatctcggc 1440cagtgcttta tttaacaaaa catccattag atacccactg
tgcgccaggc attgtgctcg 1500gtggagggcc gagtaagaca gtccttgtcc ccaagtggct
cacagcttat ctgtatgcac 1560agacgtgcag ataagtattt gcagaggcgc tagtagagat
atgcgccact tgcaaggaaa 1620acgcatcgca aataagcgac tgttcttagg agaatggaaa
ctttttgcaa tgctccagta 1680gactcccctt cacttttcat tgtccaaatt ggtatttatg
gtcattcccg aaacagttac 1740acctaatggg tttggggtta tgcttaaacc ggcgacggaa
ggcgggtacg ggtggcttgg 1800ggccaatttc ccacggtggc gtggtttgga taccagaata
ggtgttctgc tcgcagggag 1860ttacagtgat ggatgctgga tgtattttct ggaaaattat
aaacaacttc tggtacatta 1920atagttgtat ttttataagt cttttatata gctcaatccc
tattgaaatt ttggcatatg 1980atttcatttt ttaaattatt t
20014551DNAHomo sapiens 4gttcagtctt aggactcgta
ggataaagga aggtcgtttc cctccgtttg aggcagggaa 60tcaaacaaaa caccggcacc
gcaggcaccg cagtcgcact cctggggcct cgtggctttc 120ccgcgcgccc gccttggggg
cggggaagaa cccggatgga accacccttg cagcccggac 180cccccgccgt ctttgaatgg
cagcggggcg gagttgccct cccttctgtg ctctgactgg 240tttggctccg cactctcctc
ttcgtgattg ggctttccta gtgccagtca cagagcgacg 300ctggtctccc agattgttgc
agaaggagac ggcgtcgacg tctgactgga ctcgcggcga 360cttacctttc agtcgtgcgc
tcctgatccg gcgctcggaa tttgtccccg gcttcagggc 420tgcggggcct ggaaggaggc
gtatcgaggc ggctcgaaaa cgatccaggg gagccgaggc 480gctcctcttg tcatcccact
cagcgccatg tcctggatgt tcaagaggtg aagggggcgg 540agggggtggg g
5515551DNAHomo sapiens
5gtttagtttt aggattcgta ggataaagga aggtcgtttt ttttcgtttg aggtagggaa
60ttaaataaaa tatcggtatc gtaggtatcg tagtcgtatt tttggggttt cgtggttttt
120tcgcgcgttc gttttggggg cggggaagaa ttcggatgga attatttttg tagttcggat
180ttttcgtcgt ttttgaatgg tagcggggcg gagttgtttt tttttttgtg ttttgattgg
240tttggtttcg tatttttttt ttcgtgattg ggttttttta gtgttagtta tagagcgacg
300ttggtttttt agattgttgt agaaggagac ggcgtcgacg tttgattgga ttcgcggcga
360tttatttttt agtcgtgcgt ttttgattcg gcgttcggaa tttgttttcg gttttagggt
420tgcggggttt ggaaggaggc gtatcgaggc ggttcgaaaa cgatttaggg gagtcgaggc
480gttttttttg ttattttatt tagcgttatg ttttggatgt ttaagaggtg aagggggcgg
540agggggtggg g
5516551DNAHomo sapiens 6gtttagtttt aggatttgta ggataaagga aggttgtttt
tttttgtttg aggtagggaa 60ttaaataaaa tattggtatt gtaggtattg tagttgtatt
tttggggttt tgtggttttt 120ttgtgtgttt gttttggggg tggggaagaa tttggatgga
attatttttg tagtttggat 180tttttgttgt ttttgaatgg tagtggggtg gagttgtttt
tttttttgtg ttttgattgg 240tttggttttg tatttttttt tttgtgattg ggttttttta
gtgttagtta tagagtgatg 300ttggtttttt agattgttgt agaaggagat ggtgttgatg
tttgattgga tttgtggtga 360tttatttttt agttgtgtgt ttttgatttg gtgtttggaa
tttgtttttg gttttagggt 420tgtggggttt ggaaggaggt gtattgaggt ggtttgaaaa
tgatttaggg gagttgaggt 480gttttttttg ttattttatt tagtgttatg ttttggatgt
ttaagaggtg aagggggtgg 540agggggtggg g
5517551DNAHomo sapiens 7caaatcaaaa tcctaagcat
cctatttcct tccagcaaaa aaaagcaaac tccatccctt 60aatttatttt atagccatag
catccatagc atcagcataa aaaccccaaa gcaccaaaaa 120agcgcgcaag caaaaccccc
gccccttctt aagcctacct taataaaaac atcaagccta 180aaaagcagca aaaacttacc
atcgccccgc ctcaacaaaa aaaaaaacac aaaactaacc 240aaaccaaagc ataaaaaaaa
aagcactaac ccaaaaaaat cacaatcaat atctcgctgc 300aaccaaaaaa tctaacaaca
tcttcctctg ccgcagctgc aaactaacct aagcgccgct 360aaataaaaaa tcagcacgca
aaaactaagc cgcaagcctt aaacaaaagc caaaatccca 420acgccccaaa ccttcctccg
catagctccg ccaagctttt gctaaatccc ctcagctccg 480caaaaaaaac aataaaataa
atcgcaatac aaaacctaca aattctccac ttcccccgcc 540tcccccaccc c
5518551DNAHomo sapiens
8caaatcaaaa tcctaaacat cctatttcct tccaacaaaa aaaaacaaac tccatccctt
60aatttatttt ataaccataa catccataac atcaacataa aaaccccaaa acaccaaaaa
120aacacacaaa caaaaccccc accccttctt aaacctacct taataaaaac atcaaaccta
180aaaaacaaca aaaacttacc atcaccccac ctcaacaaaa aaaaaaacac aaaactaacc
240aaaccaaaac ataaaaaaaa aaacactaac ccaaaaaaat cacaatcaat atctcactac
300aaccaaaaaa tctaacaaca tcttcctcta ccacaactac aaactaacct aaacaccact
360aaataaaaaa tcaacacaca aaaactaaac cacaaacctt aaacaaaaac caaaatccca
420acaccccaaa ccttcctcca cataactcca ccaaactttt actaaatccc ctcaactcca
480caaaaaaaac aataaaataa atcacaatac aaaacctaca aattctccac ttcccccacc
540tcccccaccc c
551925DNAArtificial Sequenceprimer 9ggaaggtcgt ttccctccgt ttgag
251024DNAArtificial Sequenceprimer
10cggctcccct ggatcgtttt cgag
241124DNAArtificial Sequenceprimer 11tggggtttcg tggttttttc gcgc
241225DNAArtificial Sequenceprimer
12ccgcgaatcc aatcaaacgt cgacg
251329DNAArtificial Sequenceprimer 13atttttgggg ttttgtggtt tttttgtgt
291429DNAArtificial Sequenceprimer
14atcaccacaa atccaatcaa acatcaaca
291526DNAArtificial Sequenceprimer 15gcacgactaa aaaataaatc gccgcg
261630DNAArtificial Sequenceprimer
16aaacacacaa ctaaaaaata aatcaccaca
301725DNAArtificial Sequenceprimer 17taaaacctcg taactttccc gcgcg
251825DNAArtificial Sequenceprimer
18gtcgcgagtt tagttagacg tcgac
251928DNAArtificial Sequenceprimer 19tcctaaaacc tcataacttt cccacaca
282029DNAArtificial Sequenceprimer
20agttgttgtg agtttagtta gatgttgat
29211301DNAHomo sapiens 21tagatgagaa gaaaataaga ggaaggatac ggtagagcct
ttagaatggg gttcagtctt 60aggactcgta ggataaagga aggtcgtttc cctccgtttg
aggcagggaa tcaaacaaaa 120caccggcacc gcaggcaccg cagtcgcact cctggggcct
cgtggctttc ccgcgcgccc 180gccttggggg cggggaagaa cccggatgga accacccttg
cagcccggac cccccgccgt 240ctttgaatgg cagcggggcg gagttgccct cccttctgtg
ctctgactgg tttggctccg 300cactctcctc ttcgtgattg ggctttccta gtgccagtca
cagagcgacg ctggtctccc 360agattgttgc agaaggagac ggcgtcgacg tctgactgga
ctcgcggcga cttacctttc 420agtcgtgcgc tcctgatccg gcgctcggaa tttgtccccg
gcttcagggc tgcggggcct 480ggaaggaggc gtatcgaggc ggctcgaaaa cgatccaggg
gagccgaggc gctcctcttg 540tcatcccact cagcgccatg tcctggatgt tcaagaggtg
aagggggcgg agggggtggg 600gcgctcggtc taacggcctg gaggcgtccc caaatgaacc
tgaccttccc ggcgttcctc 660tgcgttccct gggcgatttg tgcagctgta ttcgttctgt
tggtcgcata tgtggccgcc 720ggagaaataa atgcattgtc ttcgctggcg agtaggggct
cctagggcga gtcccgtgtt 780agggacttgg gaaatctctg tcaccgactg tggggggcct
ggggcttaaa gcatctcggc 840cagtgcttta tttaacaaaa catccattag atacccactg
tgcgccaggc attgtgctcg 900gtggagggcc gagtaagaca gtccttgtcc ccaagtggct
cacagcttat ctgtatgcac 960agacgtgcag ataagtattt gcagaggcgc tagtagagat
atgcgccact tgcaaggaaa 1020acgcatcgca aataagcgac tgttcttagg agaatggaaa
ctttttgcaa tgctccagta 1080gactcccctt cacttttcat tgtccaaatt ggtatttatg
gtcattcccg aaacagttac 1140acctaatggg tttggggtta tgcttaaacc ggcgacggaa
ggcgggtacg ggtggcttgg 1200ggccaatttc ccacggtggc gtggtttgga taccagaata
ggtgttctgc tcgcagggag 1260ttacagtgat ggatgctgga tgtattttct ggaaaattat a
1301221301DNAHomo sapiens 22tagatgagaa gaaaataaga
ggaaggatac ggtagagttt ttagaatggg gtttagtttt 60aggattcgta ggataaagga
aggtcgtttt ttttcgtttg aggtagggaa ttaaataaaa 120tatcggtatc gtaggtatcg
tagtcgtatt tttggggttt cgtggttttt tcgcgcgttc 180gttttggggg cggggaagaa
ttcggatgga attatttttg tagttcggat ttttcgtcgt 240ttttgaatgg tagcggggcg
gagttgtttt tttttttgtg ttttgattgg tttggtttcg 300tatttttttt ttcgtgattg
ggttttttta gtgttagtta tagagcgacg ttggtttttt 360agattgttgt agaaggagac
ggcgtcgacg tttgattgga ttcgcggcga tttatttttt 420agtcgtgcgt ttttgattcg
gcgttcggaa tttgttttcg gttttagggt tgcggggttt 480ggaaggaggc gtatcgaggc
ggttcgaaaa cgatttaggg gagtcgaggc gttttttttg 540ttattttatt tagcgttatg
ttttggatgt ttaagaggtg aagggggcgg agggggtggg 600gcgttcggtt taacggtttg
gaggcgtttt taaatgaatt tgattttttc ggcgtttttt 660tgcgtttttt gggcgatttg
tgtagttgta ttcgttttgt tggtcgtata tgtggtcgtc 720ggagaaataa atgtattgtt
ttcgttggcg agtaggggtt tttagggcga gtttcgtgtt 780agggatttgg gaaatttttg
ttatcgattg tggggggttt ggggtttaaa gtatttcggt 840tagtgtttta tttaataaaa
tatttattag atatttattg tgcgttaggt attgtgttcg 900gtggagggtc gagtaagata
gtttttgttt ttaagtggtt tatagtttat ttgtatgtat 960agacgtgtag ataagtattt
gtagaggcgt tagtagagat atgcgttatt tgtaaggaaa 1020acgtatcgta aataagcgat
tgtttttagg agaatggaaa ttttttgtaa tgttttagta 1080gatttttttt tattttttat
tgtttaaatt ggtatttatg gttattttcg aaatagttat 1140atttaatggg tttggggtta
tgtttaaatc ggcgacggaa ggcgggtacg ggtggtttgg 1200ggttaatttt ttacggtggc
gtggtttgga tattagaata ggtgttttgt tcgtagggag 1260ttatagtgat ggatgttgga
tgtatttttt ggaaaattat a 1301231301DNAHomo sapiens
23tagatgagaa gaaaataaga ggaaggatat ggtagagttt ttagaatggg gtttagtttt
60aggatttgta ggataaagga aggttgtttt tttttgtttg aggtagggaa ttaaataaaa
120tattggtatt gtaggtattg tagttgtatt tttggggttt tgtggttttt ttgtgtgttt
180gttttggggg tggggaagaa tttggatgga attatttttg tagtttggat tttttgttgt
240ttttgaatgg tagtggggtg gagttgtttt tttttttgtg ttttgattgg tttggttttg
300tatttttttt tttgtgattg ggttttttta gtgttagtta tagagtgatg ttggtttttt
360agattgttgt agaaggagat ggtgttgatg tttgattgga tttgtggtga tttatttttt
420agttgtgtgt ttttgatttg gtgtttggaa tttgtttttg gttttagggt tgtggggttt
480ggaaggaggt gtattgaggt ggtttgaaaa tgatttaggg gagttgaggt gttttttttg
540ttattttatt tagtgttatg ttttggatgt ttaagaggtg aagggggtgg agggggtggg
600gtgtttggtt taatggtttg gaggtgtttt taaatgaatt tgattttttt ggtgtttttt
660tgtgtttttt gggtgatttg tgtagttgta tttgttttgt tggttgtata tgtggttgtt
720ggagaaataa atgtattgtt tttgttggtg agtaggggtt tttagggtga gttttgtgtt
780agggatttgg gaaatttttg ttattgattg tggggggttt ggggtttaaa gtattttggt
840tagtgtttta tttaataaaa tatttattag atatttattg tgtgttaggt attgtgtttg
900gtggagggtt gagtaagata gtttttgttt ttaagtggtt tatagtttat ttgtatgtat
960agatgtgtag ataagtattt gtagaggtgt tagtagagat atgtgttatt tgtaaggaaa
1020atgtattgta aataagtgat tgtttttagg agaatggaaa ttttttgtaa tgttttagta
1080gatttttttt tattttttat tgtttaaatt ggtatttatg gttatttttg aaatagttat
1140atttaatggg tttggggtta tgtttaaatt ggtgatggaa ggtgggtatg ggtggtttgg
1200ggttaatttt ttatggtggt gtggtttgga tattagaata ggtgttttgt ttgtagggag
1260ttatagtgat ggatgttgga tgtatttttt ggaaaattat a
1301241301DNAHomo sapiens 24atttattttt tttttatttt tttttttatg ctattttgga
aattttattt taagttagaa 60ttttgagcat tttatttttt tttagcaaag ggaggcaaat
tttgtttttt agtttgtttt 120gtggctgtgg cgtttgtggc gttagcgtga ggattttgga
gcattgaaag ggcgcgcggg 180cggaattttt gctttttttt gggcttattt tggtgggaat
gttgggcttg gggggcggca 240gaaatttatt gttgctttgc tttaatggga gggaagatat
gagattgatt aaattgaggc 300gtgagaggag aagcattaat ttgaaaggat tatggttagt
gttttgctgc gattagaggg 360tttaataatg tttttttttg ctgcagctgc agattgattt
gagcgctgct gaatggaaag 420ttagcatgcg aggattaggc tgcgagcttt aaataggggc
tgaagttttg atgctttgga 480tttttttttg catagctttg ctgagctttt gctaggtttt
tttggctttg cgaggagaat 540agtagggtga gttgcggtat aggatttata agttttttat
tttttttgct ttttttattt 600tgcgagctag attgctggat ttttgcaggg gtttatttgg
attggaaggg ctgcaaggag 660atgcaaggga tttgctaaat atgttgatat aagcaagata
attagcgtat atattggcgg 720cttttttatt tatgtaatag aagcgattgc ttatttttga
ggattttgct tagggcataa 780tttttgaatt ttttagagat agtggctgat attttttgga
ttttgaattt tgtagagctg 840gttatgaaat aaattgtttt gtaggtaatt tatgggtgat
atgcggtttg taatatgagc 900tattttttgg cttattttgt taggaatagg ggtttattga
gtgttgaata gatatatgtg 960tttgcatgtt tatttataaa tgtttttgcg attattttta
tatgcggtga atgttttttt 1020tgcgtagcgt ttatttgctg ataagaattt ttttattttt
gaaaaatgtt atgaggttat 1080ttgaggggaa gtgaaaagta ataggtttaa ttataaatat
tagtaagggc tttgttaatg 1140tggattattt aaattttaat atgaatttgg ctgctgcttt
ttgcttatgc ttattgaatt 1200ttggttaaag ggtgctattg cattaaattt atggttttat
ttataagatg agcgtttttt 1260aatgttatta tttatgattt atataaaaga ttttttaata t
1301251301DNAHomo sapiens 25atttattttt tttttatttt
tttttttatg ttattttgga aattttattt taagttagaa 60ttttgagtat tttatttttt
tttagtaaag ggaggtaaat tttgtttttt agtttgtttt 120gtggttgtgg tgtttgtggt
gttagtgtga ggattttgga gtattgaaag ggtgtgtggg 180tggaattttt gttttttttt
gggtttattt tggtgggaat gttgggtttg gggggtggta 240gaaatttatt gttgttttgt
tttaatggga gggaagatat gagattgatt aaattgaggt 300gtgagaggag aagtattaat
ttgaaaggat tatggttagt gttttgttgt gattagaggg 360tttaataatg tttttttttg
ttgtagttgt agattgattt gagtgttgtt gaatggaaag 420ttagtatgtg aggattaggt
tgtgagtttt aaataggggt tgaagttttg atgttttgga 480tttttttttg tatagttttg
ttgagttttt gttaggtttt tttggttttg tgaggagaat 540agtagggtga gttgtggtat
aggatttata agttttttat tttttttgtt ttttttattt 600tgtgagttag attgttggat
ttttgtaggg gtttatttgg attggaaggg ttgtaaggag 660atgtaaggga tttgttaaat
atgttgatat aagtaagata attagtgtat atattggtgg 720tttttttatt tatgtaatag
aagtgattgt ttatttttga ggattttgtt tagggtataa 780tttttgaatt ttttagagat
agtggttgat attttttgga ttttgaattt tgtagagttg 840gttatgaaat aaattgtttt
gtaggtaatt tatgggtgat atgtggtttg taatatgagt 900tattttttgg tttattttgt
taggaatagg ggtttattga gtgttgaata gatatatgtg 960tttgtatgtt tatttataaa
tgtttttgtg attattttta tatgtggtga atgttttttt 1020tgtgtagtgt ttatttgttg
ataagaattt ttttattttt gaaaaatgtt atgaggttat 1080ttgaggggaa gtgaaaagta
ataggtttaa ttataaatat tagtaagggt tttgttaatg 1140tggattattt aaattttaat
atgaatttgg ttgttgtttt ttgtttatgt ttattgaatt 1200ttggttaaag ggtgttattg
tattaaattt atggttttat ttataagatg agtgtttttt 1260aatgttatta tttatgattt
atataaaaga ttttttaata t 13012626DNAArtificial
Sequenceprimer 26acgtcgacgt ctaactaaac tcgcga
262727DNAArtificial Sequenceprimer 27atcgttttcg agtcgtttcg
atacgtt 272831DNAArtificial
Sequenceprimer 28aaaacaacat caacatctaa ctaaactcac a
312931DNAArtificial Sequenceprimer 29ttttggtttt tttggattgt
ttttgagttg t 313026DNAArtificial
Sequenceprimer 30gacgttttta ggtcgttaga tcgagc
263129DNAArtificial Sequenceprimer 31ggggatgttt ttaggttgtt
agattgagt 293225DNAArtificial
Sequenceprimer 32gtcgtgcgtt tttgattcgg cgttc
253325DNAArtificial Sequenceprimer 33gcccaaaaaa cgcaaaaaaa
cgccg 253432DNAArtificial
Sequenceprimer 34ttttttagtt gtgtgttttt gatttggtgt tt
323530DNAArtificial Sequenceprimer 35aaatcaccca aaaaacacaa
aaaaacacca 303628DNAArtificial
Sequenceprimer 36gttctattaa tcgcatatat aaccgccg
283729DNAArtificial Sequenceprimer 37ttggggataa ggattgtttt
atttggttt 293831DNAArtificial
Sequenceprimer 38ttcattctat taatcacata tataaccacc a
313962520DNAHomo sapiens 39aggtgagaac tgggcaaaag ttgtgaagca
gcaattctgt tatatggaca gtgttctgct 60ttttaatcct atttagcttg tttcagaaat
tctcactttt gttgactgcc aacatacaaa 120gtaagggaaa ctcaagatat taagatggct
gtatcagttc ttaaaatctg cagagcctgg 180ttcaaaatca gtcactccct tcagaagcag
acatggcatc tgttccttgc ttgcttgttg 240gttgtgtacc tttcacgaga cctgaatttt
agaattgccc agtgctgcca gagtgagtga 300gtgtaattct cctttcaggt aaagataggc
tatctcaaca ctgctgagtg attcataaac 360atatcaacca atagcattaa cccattttat
ttcctgtcct tagtgtctga agatgctcac 420cagttttctg tgtacagtaa ggcagcatgc
taaaatgctt ttgttcagtt ctgtatattt 480gaaaatagca gtgtgttctc tgatggttac
ctgcagtggc accctgtaca aaaaataaaa 540gacttattgc tgtatcttgg ttgtttaatt
aaattaagga atttcaccat acacccttga 600acaaatctat tagggaattt ttcacaattt
ttggaatttg tcatagtttt aaaaaagtgt 660aaagcttgac attgggatat atgctttaaa
aactggtatc tatgatttca atctaattgt 720ttttctgtga tggtgatgga tctgacagat
cagaacaaac cgagatcaaa cttacatagt 780gtcatccaca ctgcacctct ctattaagtg
ggtatttagc tatttagaat attttaacct 840taaagtcatt cctaactgtc agaatcaata
ctgggtccag tccctactag tattttatct 900atgatttaaa atgatagaat ggaaaagcgt
atttattaaa tttatagatc attctaggtt 960gtaagagatg gcaagctaaa gaagggtagg
atccaagatg accttttaaa actaaaatga 1020gtggtcagaa atgattctat ggaagaaaac
aggacagtta accatgacag gaagaaactt 1080tagaggtttg attgggcaaa agctgcaaag
cttttgtgac ttagccttca aagtcaccct 1140ccttattcat attgatgtga attaatatac
aagcatatag acagcttaag tcaagaatgg 1200ttggggccac cttgaaggct gttaccatag
gagctcaata agaacctgag gatttaccta 1260gaattataaa atattaaatg ataaatgact
tctgaaacta gctatttgga ctggtgaaga 1320atgagtcgct attgatcttc aagtacaatg
aagcatttac caaagattta tttaagtgcc 1380tccatgtgtc agatgctgtg gtacaaagaa
ggacttctca aaattttagc tagtcagagg 1440tctttctggc ttccgagtcc ctggttaaga
tgaacagaaa cacagtcttc agatataaaa 1500tgtcttattt ttgtggccat tcagttgcat
tcaacgttaa ttttttctat ttactaccgt 1560tcattctcta ttttttacac agtagcataa
caaagctcta aggtggaaaa gctgacatag 1620ttttaaattt tttttttttt tttttttttt
cttgaggcag agtcttgctc tgtcagtcag 1680tcaccaggct ggagtgcagt ggtgcagtct
cggctcactg caacctctgc ctcctggatt 1740caagagattc tcctacctca gcctcctgag
tagctgggac tacaggcacg caccaccacg 1800cccagctaaa ttttgtattt ttagtagaga
caggtttcac catgttgcca ggatggtctc 1860aatctcttga cctcgtgatc caccggcctt
ggcctcccaa agtgctggga ttacaggcgt 1920gagccactgc acccagcaaa tttttaaatt
tcaaataagt agtgaaggct attattaact 1980tttggaatca gaaagaatga caagcttacc
ataagacata gcatataatg ctgtcaagtt 2040atttggctag aaaatcactg aactaaacat
tcttttcctt ctatgatcta tgtcttaagg 2100tgaagtatta actaactttt ccatgtaaag
ctatacaaat attggaaaat cttttctagg 2160gagtccagaa tactaagggt tacttagtaa
aatgtataaa aaggcaacag taattcaaat 2220tacaagattt atatttgcag aggtgatcca
tatatactta tccccttgca gtggctggta 2280tgacctttgg ttgtaagaca aacttgccca
caacagaggt caaatccatg cctttggaga 2340ttagctccat ggtggatgga gctatggttt
atgcataaag taaatgtttg tttaccttaa 2400ttctccttat acccatattg tcctgctgta
taacacattt tgcagatatt ttgaagttaa 2460tgtgttaaaa acttgaggtt aaacatttga
gtttttgtta agagccaaac atcaaatgtg 2520cccttatatt tttaatgaat ctcatccaaa
tgctaatgca taaaccttga caagtagtat 2580aaataaaaca agaaaaaaat acagcaatgt
ctttgccatt ccccaaaaca aagcacacac 2640tgccgaagat cattagtact cactggtaac
aaactacata gggttagttt gtatttccaa 2700ttctagagct gtaattttaa ggacaaaatg
tacaatgatt gattaagagt gctatctgtg 2760tatatatagg tattatcaca actccttttt
ttcttccaga tgaagaaatt aattgggacc 2820aatgttttta gatcaaggca ttttaaataa
gcactcttga tttctgaaca agaatttcaa 2880ccagctaaat tgagcaaaat aaagttagtt
aggatatgag gacattattc tgttacagta 2940atcttcatgt actctcaaaa aaatgtaaca
cttgcataga aatgtcacaa ttaatgaagg 3000attttatttg aagataaagt caaaattatg
gcaccgagga aggtaataaa catttgaaat 3060ttttattgat ttttaaattt aaaatccagt
tttaaccaca aaattgtttg aatcacaagt 3120ggtaatacaa tgtcttcaat atttttctaa
agttattttt ctatataata ataagacaac 3180agcatagcat ataggaagtt ttcattccag
tggctttttt atatatttat ccttcttagg 3240aaggacaaat taaatttttt aaattaaact
tttaaaatat aacaacatct aacagaactg 3300tacaaaacaa agagacattt tttaaacaac
ttgccaaact tacttatgag tgtgttttaa 3360aaacaacttt gtaaatgtct gggcaaagaa
gcaagctgtc ctccctttac cttcatagtg 3420agtttgtaag gctttgtctt tgtaagcaga
aagagtagac tgtgttgttt tttgccaaaa 3480actgtttata cttaatctca ctgaagtatt
gctatatgga gaacccatac tctgatcaac 3540ttgatttttt gtgtgtaatg cttgatctac
caggtaactt cccaactgct cctaatgcta 3600gcgggctaat cccacattat tattccacta
tcatccctgc agaaaggtct tggttttgat 3660gaaaatcagc cctttcctta cctgctactg
cctcaaaaag ggaccaggaa gattctagct 3720ggctaattca ctgttccctt tgagcaagaa
aacggcacag ggagaaaagg acttatctgg 3780tgagagattt ggcatatacc ttcaatgtgt
gccctataac acaacattgt ctccgatctc 3840atctttctat caaatgactt ccaacactct
taagtctcag gtattcttaa atctgtatca 3900tcaaacatga agcttctctt gtttgttaga
gtaattaatc tttctttgga ttaaagtttc 3960cctttgaaat aaaaccacct acctaatctg
actgctaaat ttctagcttc tttgttttaa 4020atatgctcag gagtcaaccc aaattctgca
gcaaataagt ttgcttatta acaaaaaagt 4080aaaaaaaaaa gaaaagaaaa aagatgacta
attctacaga tagctgtaag gatgaattac 4140tcaagttcaa aatcaaattc tgattctaaa
cacataacaa ttgtttacat tcaggattaa 4200gatgtcttta agagttgaaa cgactttgga
gatcatccag cccaacttcc atccagatat 4260cacgcctctc acatatagta gtcttctgaa
ttataaaaat ttataaagtt acttccaaaa 4320aaagctacat aaataaaaat tatctattta
tagaaatatc tatttagcag ttccataatt 4380taaaatattc aaatcaaatt gggtaggact
ggtttgcctc tcactcccac agactatatt 4440tatacctcag acacagcaag ttacatttaa
acaatgagtg tagtactact taactaaaat 4500ggaaaaaata gtactcttaa cataatccct
aattttttca tgaacataaa actccaagtc 4560atttatgtga actatatctc aatgtagctg
taggaaaaat aaaaacctgt gctaacctgg 4620actttggtct catttaagat ttggttctgg
aatgcaaata tggtttttga aagcccaata 4680aaattaattc ttgtatagtc tgtatatatt
gtttacaagg actacaaaca ctgcatcaca 4740aatcggaggc tttggtaaat aactaagtgt
ccaacataga aaataactat ttggtcaaaa 4800gtataaaagg tctgacctta tttgaaatac
gaaaaagctg agtacttgga agatacgtga 4860aaatactcag catagatatt atgaaaagct
gaataacaaa gtaacctttt ttctcaaatt 4920atttcaggcc acagtatata acggaactta
ttgctatttg aagtttcatt aaaaataggt 4980tcatatatag aagaaattgt gtcagtaata
cctcttcact aatataaaat atgccccttt 5040tagaagacgt gttctctaga tctcatttct
aaaactctgt atttttctca tttctaaaac 5100ttaagtatcc aatcaaactg accttactaa
aatcccacaa attataagtc aattaatgtt 5160ctgatttcat taattttggc ttgtttcatt
tcgtcagcat ttggtttttt agttccaaag 5220gctcctgctg caagttctct ctttttgttt
tgtattttca gcatattttc ttcaacagag 5280tcctttacaa tgaactttaa aaagaaaaaa
aaaagttaag tagtttttaa ggcatagtat 5340ttaaaatctt tattaaaaaa actaaatgaa
agatttgcct aatggcaaaa accgcaatta 5400cttttgcacc aaactataat aactatttaa
tagaactgaa ctttgctttc cttacccaat 5460atggttttct tatataaaga atatggctgt
acgaatagaa gttacagatc acaagataag 5520caaagcaata aagaaaaaaa aagcaaacta
tataggcaac tctggcaaaa tgacttcagc 5580attgtattaa aaaggccatt ctggccgggc
acggtggctc acacctgtaa tcccagcact 5640ttgggaggcc gaggcgggca gatcatgagg
tcaggagatc gagaccatcc tggctaacac 5700ggtgaaatgc catctctact aaaaaataga
aaaagttagc cgggcatggt ggtgggcgcc 5760tgtagtccca gctactcggg aggctgaggc
aggagaatgg tgtgaacccg ggaggcggag 5820cttgcagtaa gccgacattg cgccactgca
ctccagcctg ggtgacagag cgacgtctca 5880aaaaaaaaaa aaaaaaaaac aaaaaactat
tcccagaaat tcaagaatga atctttagta 5940taattcacaa cactaacaaa gagaaaaata
aaacgtgatc atctcattcc tgataaaaac 6000ttgattttaa aataccccaa ctaggaatga
atgtctcaac cttacatatt taatgtgaaa 6060caataaaagt attctacaac aaaacagaaa
caagtcaaga atggtcacta tctgctacta 6120tttaacattg tttaagaaga cccaattact
gcaataacta aagctatata ataaaatttt 6180actgaaatac atataagatc tacataaatg
gagaaaggat actatgttcc tgtatggaaa 6240cattcgttac ttataaagaa agatagttct
cgaaagtaat aataaattca ataaatctca 6300agatatctta aatgggattt tataactaaa
atctaaagtt caattcaaga gcaataataa 6360taaccaccaa tacaattttc aaaaagggtc
atcagaacat tttaccctat caggtatcaa 6420aacttacttt aggaaaccat agttgttata
agagtaatac tgccctaaga aaaggaaaag 6480agatcaatgg aacagaatag tctagaaaaa
gacccaagta atatatataa attagtttat 6540gatgaaagtg acatttcata ctagtaggga
aaaagttgga attttcaaag aaaaaaaatg 6600atgtgacaga acaaatcgtt cttcatctga
taaaataata agcttagact catgtcacaa 6660atcataaaac aaaaaataaa atcttccaaa
agaaccagaa gtaaatacag aatagttgaa 6720aaaatgtgaa aataaagcct tctgtcctag
gtaagacctt aaaacctaga acgtataaaa 6780aaggctgacg ttttaaatca taaaaatttt
aaagcttttt catgaccaaa aaaatgcagt 6840ttaaagacaa ggaagagtat gggataaacc
attttcaaca tacataaaca aaagatttca 6900agtcagactg tatcagggag tcctttatac
tataaaacta tttgtaccat aactatgaga 6960actggacaga agatataaat aattcacaga
aaaaatgtta ataaacaaga tatgtgaaat 7020atactcaagg ttttaggtaa atgcaaatta
gaacagcact tttttggcca cagaagcata 7080aattaacaat tttcatataa atcctatttt
gaaaccaact cgatagttga taaaaattct 7140aaatccagaa ttctagtttt cttaacttat
cccaaagaat tacttgcatc tatgtgaaaa 7200gatttatata taataatgct cattatggca
ctgtttttat tagcaaaaaa cttaatgggt 7260taaatatatt acgatatatc tatatggttt
tgtagagtag tcaaattcag agaaagaaag 7320tagaatggta gttgccaggg gctgggggga
gaggaatggg aagttattat ttaatggata 7380cacagtttca gtttaagatg atgaaaaagt
tctggagatg gatgattgtg atggctgttg 7440cacaacaaca tgaatgtact taatgcatct
aaactgtata cttcaaaatg gtaaattttg 7500tattttacca ctatttaaaa aaagtgttca
tctaaatgta cttatgaggt agtctccaat 7560gtatagtttt aagggaaaca aatatgttgt
agaataaagt ataattccat ttatctttaa 7620aagaatgttt ttaaggagat gtacacataa
ctataaatgt aggtgcagag aaaggcccct 7680taaacagcat atattaaatt attgatgaag
gaggaaaaag gtgacttata ttttgctctg 7740cataattaca ctcaatcata ataatatatt
catgcattac ttgcttaatt gtaaaaatcg 7800tatcaaccta gtcttaaaat agtttgttta
aaaactcact tttgtgatga taacttcttg 7860cttctgacca agtctatggc atctgtcaaa
gcactgatct tcagcagcag gattccaggc 7920ctaacaagaa catggatgag ttactttact
actgccctga tctttcagtg ttttgcattt 7980gtcctttcat ttcactagta ttcattcatc
tttttctgac tgaaagtttt ctgcctgaag 8040tcagatacta acatcacaaa tactctgggg
aaaaatccac cctcccaatc actatgatga 8100taggcatcct ccccctacta ctacttctaa
tacctaatac aggccccgac atatgtatca 8160gcacccaaaa taagtatttg ttgattaatc
aatttggttt tttttttcat ttttaaaaga 8220atctatgtgt gcttaacagc agaaaaaggt
tatacatcct cttaaatgac ttaatttata 8280tgagaaaaga cacaattctt gggcctattt
agttttctcc cccttttctg gttttctttc 8340actttcctct gttcatgaaa taaaagtata
aaaaaagggt ttctcctggc taacacagtg 8400aaaccccgtc tctactaaaa atagaaaaaa
ttagccggga gaggtggcgg gcgcctgtag 8460tcccagctac tcaggaggct gaggcaggag
aatggcctga accccgggag gcggagcctg 8520cagtgagccg agatcgcgcc actgcactcc
agactgggcg acagcgagac tccgtctcaa 8580aaaaaaaaaa aaaaaaaaaa aaaaaggggg
ttttaagagt cagtaagtcc taaaaattat 8640ttttttgttg atgctatttt ctgtatcagt
atttgacgtt agtttgactt taaagctgag 8700ctaaaaaaag taccattagc taattaacac
agacatgatc atgagaactg ttttaagcta 8760aaaaataaaa tgatgggtcc ggcacagtgg
ctcacgcctg taatcccagc actttgggag 8820gccgaggcag gcagatcacg agttcaggag
atcgagacca tcctggctaa cacggtgaaa 8880ccccatctct actaaaaata caaaaaatta
gccgggcgtg gtggcaggcg cctgtagtcc 8940cagctactcg ggaggctgag gcaggagaat
ggcgtgaacc caggaggcgg agctggcagt 9000gagccgagat cgcgccactg cactccagcc
tgggcgacag agcgagacta catctcaaaa 9060aaaaacaaaa aacaaaaaac acaacaacag
caataaaatg gtgcaacaat gaaaagggaa 9120gaataaccaa aaagatatag acccctggta
agtggttttt aaatagtatc atgaattgtt 9180tggataagga tcattaaatt ttaaattaaa
actatgaggt tttttttttg agatggagtc 9240tcgctctgtc accaggctgg agtgcagtgg
cacgatctca gctcagtgca acctccacct 9300cctgggttca agcgattctc ctgcctcagc
ctcccaagta gctgggatta caggcgcttg 9360ccaccatgcc ccgctaattt ttgtattttt
agtagagacg gggtttcacc atgttggcca 9420agatggtctc aatctcttga cctcatgatc
cgcccacctc ggcctcccac aaagtgctgg 9480gattacaggc atgagcctcc gcgccaggcc
atattacccc aatacttata caatgttcaa 9540tgaaaaggta aaagaaagct gggcacagtg
ccatatgcct gtaatcccag ctactcaaga 9600ggctgaggga ggattgcttg agctcaggag
tttgagacca gcctgggcaa aagaggaaga 9660ccctgtctct taaaaaaaga aaaaaaaatt
taatggtaaa agactttcat gagcaaattt 9720ttattttttt attgaatcat tatgtttctc
ggacctgact tttaagttag aaacaacatt 9780ggaaagcttg atatgttcaa tgaagaacac
tcaacttact ataaaactgc taaaacgttc 9840tgtaaatttt taaaaactgt taacacgttt
agtctctgat caactttttc atactaatta 9900ttttggtatt aacctctttg gtactaagat
tataagcaag aaaaataaaa tttgggattt 9960gttctaaaaa tcagcttcta gaattatcac
tggattaatt aatattcaac aatctatagt 10020tacaaatatt ttttatagta gccataaaat
actgcaaaca aagatttcaa acttcatcca 10080cagtacaaag gcaagaaaat acaacagacc
ggctgggcgc ggtggctcat gcctgtaatc 10140ccagcacttt gggaggctga ggcaggtgga
tcatgaggtc aggagttcaa gaccagcctg 10200gccaacatgg taaaaccccg tctctactaa
aaatgcaaaa attagccacg cgtggtggca 10260ggtgcctgta atcccagcta ctcaggaggc
tgaggcagtg aaccgcttga acccaggagg 10320tggaggttgc agtgagccaa gattgtgcca
ccacactcca gcatgggcga cagagcaaga 10380ctctgtctca aaaaaaaaaa aaaaagaaaa
aagaaaatac aacagaccat accatatcca 10440tgatattcac ccatgaaggc ctaagtatga
tgaccctagt tataatctta ctgagaagga 10500taccgtcagt tgagggagga aaaaggaagc
ctgagtaaac atatatgcat caacaaggta 10560gcctcaccca agtaactaaa atttatatct
cagtagtatt cctacatgta tctcagtaga 10620attcctacat gttaaaaatc ttctgagttt
caggtaacaa gttacaacca caaatcttaa 10680gtctgaatgc acagaaacag cagagggttg
cctttagccc acatagtacc cagtgcaaat 10740tagaaactga attagtccaa ggtcctctca
ggccaaacag cccctttgta cacgtagaaa 10800aacacgctcc tatgtgaatg cacgtccagg
tcacacagct tggtgagtaa agttcaaagt 10860gaattttagc ctctgttcta ccttcatgcc
tgtactacac aaccacaact gactgcacta 10920tattaggcat taaaacgtac tcagaacatt
ccattaatga taatttgttt ctcaaattgt 10980gatcctagac cacctgtacc tcaatcacct
atgatatatg ctaaaaatgc agcttcctgg 11040gtctaatcct agaactccta agtctctacg
ttgaatgcaa aggaaattgc attttaccta 11100attcttatgc acattaaagc ttaagactac
tagaaccgtg atggtaatca ttttgtctat 11160cttgtttata gttagattcc cagtgcctat
aaaaactgac gcaaagcagg atctcattat 11220ttgctggctg aatgaaccac taccctaaaa
agcattacag ttaatattta ttatagaaga 11280aaagtagaat agcatagcat aatagtactt
aaatatttac atggactatg attctggttt 11340tgcctattac taactgtatg atactgggca
cattatttac ttttctgtgc ttcagttgcc 11400tcttcagtaa aacagagcat tttttttctt
ttttttcttt cagagacggg ttcttgctct 11460gtcacccagg ctggagtgca gtgacaggag
catggctcac tgaaacttca acctcctggg 11520cccaagcaat caatctttcc gacttagccc
tgcaagtagc tgggactaca agcacaaacc 11580actacaccag agtaactttt ctggtacttt
tttttgtaga gatggggttt tgccatgttg 11640cctaggctgc tctcaaactc ctgagctcag
gcaatccatt caccttggcc tcccaaagtg 11700ctgggattac aggcatgtgc cacagcacct
ggccaaaaca gagcatttaa taatggtatc 11760tacctcataa ggatgttgtg aggatttcag
taaagtgctt aagagaaagc tcttaatcca 11820catttactaa gtttttgtta tttatttcta
aagtttgcct aagaattttc acagttaatt 11880ataactatga aggtatttac caaatagttt
tttgtcttat ttttttgaaa gaagtactaa 11940gattctgaaa aaatttacag aactacttac
tggatccatt aaaaacactc gagaagctgc 12000agacagattc aaaccaactc cacctgcttt
taaggacaga agcattatag ttggagatcc 12060tgcttcagtg ttttgaaaac actgaattga
ttcaactctt ttcttttggg ccatggaacc 12120atccaaacga gtaaacacaa atccagaggc
tctaaagggg ggaagaaaag agacaagtaa 12180caaacactat tattataaaa ataaaacaaa
gttaagtaac tacaaatctc ttcaataaat 12240tctttcactc atttatttta acaaatgttt
attgaacact tattaatatg ctaggcactg 12300agtcagtcat tttctgataa attcaggact
taatctttaa attcttgtgc tgccaggaag 12360tattctctat tgtgactctg aatatctttc
agataaaatg actaaaactg gcaaaacaaa 12420ttaagttttt ttttttgcta tccaaatcaa
tatgattata taaaatcctt ttactgaaag 12480tgaaaaacac tgtggaactt acttaagtgg
tatttctatt aaagacagga atgttgtaaa 12540ctgagaaaca accaaacttt ttatgttggg
attcttcttt cttaagtcag tcaatgcgtg 12600cattagcgca ttaatctgca aaaaatatta
agatgtccat tagattccat tttaaatcag 12660taaaattagt ctaaaagttt ttatcataaa
gaaaaggtcc tgaaatgtta tttccctaaa 12720atgcaaaaga ctactcattt aaaacaaaaa
attaagactg aaagaacact ctaataatct 12780gaacccattc catttcccct caaattcacc
actaaaaata aaaagccagt ggtcaacaac 12840agaacactaa aatgacagga atgaaagcag
aattacgatt ttgatatact gtgtatattt 12900ccttatataa aggtaaaaca aatactcaaa
tatttgcaca tgagtacttt acctttgaac 12960tggatgtcca ttccatatca gactttttct
cactgtcacg tgctaattct tctggaggac 13020attctaataa attatcttca tgtatatcat
ttctgcataa agggcattta gcatgtggct 13080atataagaaa gaacgaagta tgagcaacac
ttaacagaag aacaaatata aatatgccta 13140aaaattttta aatgtatgct aaaatctgta
tctctcatag ctaacatttt attaaacctt 13200ttaatggatt tcatccattt aagagacatc
tgaacatgct gaactccaga cataatgtgg 13260taagtagaac ggataaagga gggagcaaaa
gtatgtgatc tggactagct agaagcaaga 13320gcaagaaacc aaaaattata acattgtaag
caagtttaaa aaaaaaatct atgaataaca 13380atttattgaa cctaaataca atacagctac
actatttagt actattcttt aaaaaaaaat 13440ccatcttaat ggcttataaa gctattaagc
ttataaagct tataatagct tataaagcta 13500ctaggatctg ttaaatccta tcaagttcat
tccaaagcaa catctgttgt acccgtttga 13560acctgttaat aaaataagat tatatcttaa
ataactatat ataaaacacc attacaatta 13620aataattctg aagctaaaat tatttaaaag
tcaatacagt ggggttactt ttggggaaga 13680gggagagggt aaatggttgg gaggggccac
aaggggacct aggcagcaga tacaggaata 13740tgtttacctc ataattgact gagctattta
attatgaatt tatacatttt ttctacatat 13800gttatatatc aatttaaaat gtttaaaaaa
attctacccc agctgaatga ctgcttaaca 13860gactagtcaa cttcaaaaaa attctaccac
acaaaagaat ctagacctac cttatccaac 13920acaatagcca ataatcacat ggggcaatta
cgtccttgaa atgtggctaa gccaaattga 13980ggtgtgctgt aagtatacac caccaccaaa
ttctgaagat ttagtttttt gtaaaaagca 14040cattaatact ttttatattt tagatatatt
gggttaaata caaatgtatt aaaattagct 14100taatgtttct ttttacttaa gagtatgact
actagaaaac tgaaaattat ctttgtgtct 14160cacattatca atccatttat tgaacagcaa
tgatataaac agcggaatta agatattgac 14220tgtgaacatc tttttatgtt gcaaatgttg
cattagaaga aatatatcac ctattaaaat 14280actggaataa ttaactatca aaggtatatt
attaaatcca agtgcaaaac tccatatact 14340atcttttatt atattaaaga caggtacact
actctatcaa acactgaacc tgtactgagc 14400aaaaaactaa cctgctcatt ctgaatgact
tggcaaatac agggtttaca aaatacatgt 14460gcacaatgtg ttatcacagg aactgttaaa
gaatccaggc aaattgcaca ttcctcatct 14520gaacctgagc tcagaattaa cttcatcttc
cttattaact tctttctcag ttcttcaggt 14580gtatcatttc ctagagaaaa ggctgaaaaa
ttaatttcag agcaaggttt gtaatatgac 14640aagttaatct atcttataag gaaaagtttc
gcttgccagt agggaaagtc tctcatcatt 14700ttctgccata ggcaaaaagc ataatttaaa
tttgaaattt aagaaacatt cagaagccaa 14760aatagtagcc aactcacata atatacctca
tagagaagca tcattataac atgacaataa 14820cataccacca cacaataaaa ggcagatcca
atatgaatct ttcatcaaag ggtatctatc 14880accggaattc ctggaaagac ctcattataa
ttccatttca ttggtaaaaa aattattgga 14940tgtattccta gctgagtctc acactaaaag
caatctccat ttgacagaac aaaataaact 15000aaatgtatgc cacataaaaa aaatttatca
ctcttattta agtatctctg acaaatacaa 15060tgatatgaag caatcaaatt tacctacctg
aggggccatt ggaagacact gcatttgtaa 15120gaaggtaagt atggcaacaa atttgccgca
gtctaagcaa aagacccagg acatctgcat 15180aatgtgccag gacagtccct tcattaaaat
acctagagat taaaatgtca aatattatta 15240agtccgctaa acaataataa cttcctattc
taaaacagca gagtaaaatg gcatttttct 15300tttcttctct agaaattact ccaaaacaag
aatgaaaagc agaaaagcaa ctctaataaa 15360accaagagag atctataaca ataaggcaca
atatatgaga aatggctgcc aattgcagtc 15420taagtcagat aggaagatgc tgagagggat
gactgcagat ctcaaacaga gctgccaaaa 15480cacaattctt taaaatggat ggcatgccct
gaggggaaaa tcaatacctc tcatgtgcag 15540aaacaggaac agggtgcatg cttggtgact
tgagcttcca tggagaaatt tatatcaaaa 15600aaagagggaa ggtaaactga atacagaaac
atacaggtgc actgtatttt caaaaaaaga 15660ctatgggcac agcactctac actccaagca
cccctaaagt ttctgatctc tgtttatgtg 15720tactaataaa acctaaagta actcaacatg
taaccctgag tcttaagtaa attgggttat 15780ttcctgctgg tcggggacat gattgtgacc
tttccctttt gtatatctgg actagcaaat 15840agctagtcca agaagaactc aagatattca
aatataagta ataatcagca gagagcaaag 15900gtaggggaca catggcagtg aaagaaaact
cagcagaaaa atgttgccac agagcaaata 15960aaaattatga ccaaggatac tgtcacaatt
taacaccact caataaaaca attcctccaa 16020aaaaaaaaaa tcttaaagca aagacaaaga
gctttgaaaa aatatggcaa gagaacagga 16080agagatgaaa gatgagctgg tatggctcaa
gaaagtgaaa gaaaacagtg aatcatcaga 16140gaaatgaaag tcacactgaa accaatataa
gccaatataa aaaaagacta gtcactgata 16200aaaacataga aaaggatgta ggaaacagga
ctgagaaaat caaaattaaa caaaaaaatg 16260taaaagatca gaaataaaat gatacctatg
gaacacagat aaaagacatc taacatacac 16320agagagaaat aaaataatgg aacagaaaac
aatatttaaa gaaagctttc caggaaagag 16380aaggggggaa aagcctggaa tctaaagatt
taaataatat cacatgtccc aggggaaagt 16440caacaaagaa gttgctgaac ttcagaggga
aaaaaaaaaa aaaaaaaaaa accaggcttc 16500cccacagaaa cattctatac aaaatacagt
ggtacaatga caacatagat tcttaaggaa 16560atgggtgatc caaaaattat ataaccaggc
catctgatat ttaagtattt taaggcaaca 16620gaagaaatat agcctaacgg ctaagagtac
aaatttttag agctagctgg tcatgaattc 16680aaaacctggc tgtgccaatt taattatgta
accttggaca ggttgcttaa cctatgtctt 16740gagcttcctc agttctaaca gtggcattgt
aaaagtacct acctcataca actactgtta 16800tttaatttaa tacatgtaaa gtatttaaaa
tagtgcctgg actatattaa gtgctataca 16860tgaatctgtt gctagtatta gtactggtag
tattatggca acaaacattt ctgaatattc 16920aagaatgcaa ggaataggca aaataataca
aagattaaac ttacaagctc taaaccaaag 16980tgccagagtt aatctgtcct gtcactcact
agctgtgtga caacagtgta agtcacctaa 17040actgctctgt gcctcagttt cctcacttat
aagatgtgga taattatact gacctaatag 17100gattgctgtg aatacattta aagcactaag
atcaatgtat aacacctagt gagtgctaaa 17160taaatgttag taaatatata gcctattaaa
aagaatactt taaaaataac ttattttctg 17220agaacagttt caataaagtt aatctttttc
aactttttca aatagaataa cctttactaa 17280cactaattat tccctggcat ggtcatatct
tactgatact cagcatttca atttaagaga 17340tctcttttga gtgcctacca tgtgctaaat
attagaagct tccttcctcc tatttcacca 17400gccattgcta tcttcacagt tagatctctt
tcactttact cactcccttg gtgatctcat 17460ggctttaatt accatccaca catagctgac
tttgaactcg actctcctat ccaactaatt 17520atttgatatc tctacttgga aaattcacgg
gcatcctatg atcaatgtgt ccaaaactga 17580gcttctgatc ttagcttaaa aaaatctctg
ctcttcccat agacctgtct tggtaaacag 17640caattccagc cttccactta tgcaggtaaa
aaggcttgaa gttatctttg attcctcttt 17700cacatgccat aacatatcaa ccagcaagtt
ctgttggttc tacgttcaca atttaacaga 17760atctaatcat ttctcactcc ctttatcact
actatggtaa aaaccaccat catctcttgc 17820tttgactgcc actggtctcc ctacttctga
ccttattcct attcaaataa cctaaatctg 17880agcacagcag ccagagtgat cttttaaaaa
tataaatcag attacgtccc tctcctggct 17940caaaaccctt taaggtttcc tgtcataaaa
agtaaaagcc aaagtccttg acatatgcgt 18000tacatacccc cattccaata aatctctgat
atcatgacta ttttctctcg ttcatactat 18060ccagccacac ttgcttctct gttattcttc
aagctcacca ggcatgttcc caactgggag 18120catctacact tgctcttcct tctggctgaa
acacccctta atcccaatct cctttagatc 18180tttacctata tatcatcttc tcagtgaggt
cttccttaat accctactaa aacagctttg 18240gcattcccca tcccctttcg ttgctttatt
ccttaaaact taatcatgtt tttaaattat 18300ctgtttcttc ccattaaaat ataaattcct
cgagtgcagg catttttgtc tgttttattc 18360tgctgtatcc caatgggtat aagtgcctgg
cacacagtgg gcacttaata aatttttgtt 18420gaaagaatat gaaaagaaag aagattttga
actcagaaat ctgctgctta gaacaatctt 18480tcccactctc tctaattcct atcaaacctg
ctttgtacct acctcactga agtttttatg 18540ttttgaagat tcccctttca cagtcctatt
ctgctttcac ttgtctctta ttcattctgt 18600tccctgtggg ctcacattaa acctgatctt
gcttggacca gaatttcata ccttctacca 18660atccttaatc ttcttgttaa acctatttca
tcagttccta tttccagatt tccatacgat 18720tttggaaata gaagcaagca aaatcttctg
gctagcaatt gagaacttgt catttaaagt 18780aagctttaaa ttatttccta acttaaaaaa
ttacttacct ggtgaattat actattattt 18840cattcacttt attgatttgt aaaaaacact
gtaattcata gaagcaaatt atctttcagt 18900atacttcaaa ctattgctaa cacatctaaa
taatgtcatt cagtgaatgg gaaacaaagt 18960aaacaaaaac taatttttaa catggttttc
tttctccata ccttccaata gtggctctgc 19020cttcattttt cacagactga taaatctttc
tctcttcatc tgaaagtgta atgtgctgaa 19080taaatacttt acgttctggt aactccaaaa
caggttttcc tttaattttg cttgtctttg 19140ttcttctaag tgtaatattt ttaattaggg
actgtaaacg cctaatcaga ataaaacaaa 19200taattatgta acttttaatt tctacataaa
ttgatccaag aagaacgaaa cagaacattt 19260taatttgtta ctgttttata gcttctgtaa
aagtgttttt aaaatccctt atttgggatt 19320ttaaccaatc tgataatcac agaagtattc
tacaaaaata aaattattaa aacaaatgta 19380tggaaatttc ccctcaatac ctaacatagt
ttaatctttt tatatcctgt ccttattttc 19440gtaagttctg ttccccattt ttgttccaat
cactaactct tttacttctc aactgtcctg 19500tcaactccat caccgaatag tcaacactca
aattctaagt atctattaag tgccaggcac 19560tgaattaaat gttgtaagaa tacaaatatg
ggtaagatac tacattctgt aatattaaaa 19620tcaaaagtat ataaataagt aattgtataa
agtaatgagt gcccccaaaa taaaaagata 19680atatggtaat tcagaacaca gaaaaatgct
ttacaggttc cacaatcttt acatttcttt 19740ctcagtactg gaaaaaaaga gatgtatcaa
ttcttgtttt ttcccaacat tgaaatataa 19800atacaatttc ctggagttta aatagttggt
tttgaggaat caagtggata gtactgccaa 19860ctaggttagt ggctcttgaa gctttctcac
tataactcac agtaagaaac acattctatc 19920cagcaaaaat atacaaacac atacaaaaga
aacaaatatt tcacaaaata atacttacta 19980tgtatcatgc actccgatat ttactaatct
cttctctatt tcattgtctt tttaaatgct 20040ggctaccacc tttataatga ttttatatcc
cactaatagc tctcaaatct tcagtctata 20100aaccggtgct ttagccaaaa agttgaacca
cattacatct tgcaaaatgg ctatcactga 20160gttttaattt atgttctcaa cttttatcta
tctccaccag aaacaaccac ataaaacttt 20220cctatacaac ttttgctaag gtcatggtac
ctcacatcat tcactatata aagctatgta 20280aggccaggtg tggtagctca tgcctgtaat
cccagcactt tgggaggtcg aggcaagagg 20340ataacctgag gtcagaagtt tgagactagc
ctggccaaca tggcgaaacc ccatctctac 20400taaaaataca aaaattagcg gggcatgctg
gcgcagacct gtaatctcag ctacttggga 20460ggctgagaca ggagaatcgc ttgaacccag
gaggcggagg ttgcagtgag ccaagatcgc 20520gccactgccc tccagcatgg acgacagagc
aagactccat ctcaaataaa taaatacata 20580aataaataaa taaagctata taagatatga
aaacttaatg catattagat cctcaaagga 20640tatacgatgt ttcctgctaa atctgttaat
gctggttttg gggctttggg gcttagcaac 20700acacaaaaaa cagttatgtc tttcctttca
gtttagccac aacaggggaa gggtagttat 20760cgacaattag aaatctttgt ctttccaaac
aagcatcact aggttctgaa agacaaaggt 20820actgagccat aaacactttt tagaaggaaa
agttaagagt caagttaaaa tagtaaagaa 20880agtgccaact ggttcaagct actaacagtt
atttgtaact aatattaatc actgtctgaa 20940agtacttacc taagtcctcc ttcatctccc
attgtgacag gacgctgtat tgttctatgc 21000caccattctc tatcaataaa tggtttaagt
tttaaaaagg aaagaagaga ccacaagtcc 21060tttaaagaat tctggattgg agtacctaga
aataacagga aactgttata actctttaac 21120cagagtatcc agtaagagta tctatttggc
ccaatcagag aagacaacaa ttgaaaatgg 21180gcactctgta ccaggctcat cattatcaat
taaattaaac tatcaagtgt tgactataat 21240catttattaa aaatatacac atctcaagat
acagttttgc tctaattttt agctctcaca 21300tgacacacta gccaagaaaa agactggaaa
attcaacttt ctccttatca agtacatctg 21360cttatttagt ttttaaagca gcagcactag
cagaaggaaa aagcaactag tttaagttta 21420cattcttttc cttaccttct agtaaaggta
ctgtgaatgg caaacaggtt tgaggatggg 21480aaacagaaat caagattatg actggatgac
tagagtgtct actccatttc cactagagtg 21540tctactccat ttccaaggtt caagaataag
tctagtgact gttccataaa ttatcaataa 21600gtcaaatgtt gttatatcac gggcaattat
gaagaacaac tcaaaacatt tgattaaatt 21660atttatatga ttaattttgg ggcagaattt
acacccactt taagtttaca aagttacttt 21720actaacttac ataatgatat attttagaga
tttaaggttt tagtaagttt aatgctatga 21780ttacctgtca aaacccatct tctttctgat
tctaagtcaa gtacagcttt tgtctgctga 21840gcatttggat ttcgtatggc atgtccttca
tccaggatca ctcttagcca ccttatgcta 21900tgtaatggac tatctccttt agtctgaaat
aaatgtttta tatgaattaa aaaacacagg 21960aaagtaaaat agtacttaat atgatttgct
aactagttgt tatcattatt taatatcaaa 22020tttactatgt gctaggtact gctttaagca
ctacattaat tattgaatac tcacaagttt 22080ataaaataag tacttattag cacacctaat
tttacagata agaaaactga agcacaaact 22140gctaagtaag tatgttaccc aaagcaccac
aggtggcaga actagcattc aaacccaagc 22200agcagtctag ctctctgttg agtcactggt
caatacactg aactgttcta cctcggatta 22260aaacatagca atgatataga aaatggcagt
tgtaagaatt acttgaattc aacattactt 22320tcttttatat aaaaggtaat atttttacta
gaagactgaa attttgatag caaggtattt 22380ttcaaaaagt tttaactttt gaacaattgg
aattataaag gtaaaaaaca attagactaa 22440ctggaaaact tatgaactgt aactacaaaa
aaagcatacc ttcaaactga gatattaatc 22500aaatcattgt gatagaaaac aattttccag
aatattaaaa aagaaaataa aattcattta 22560caaattgctt catatactta caatccaaat
ttaaaatctt aacttaaaat gttctaaatg 22620aaaaatattc ctgtctgggt gtggtggcac
acgcctgtaa tcccagcact ttgggaggcc 22680aaggtgggtg gatcacctga ggtcaggagt
tcaagaccag cctgaccaag atggtgaaac 22740ctcatctctg ctgaataaca aaaattagct
gagtgtggtg atgcatgcct gtaatcccag 22800ctacttggga ggctgaggca ggagaatcat
ctgaaaccag gaggcagagg ttgcagtgag 22860ctgagatcac accattatat tccagcctgg
gcaacaagag caaaactcca tcgagaggga 22920gggggagggg aggagtggag aaaagaaaaa
tactcctaaa ttagcatcta aataaaatgc 22980aaccctttca cgcaaaaagt caaaaaaaaa
aaccatttat ttttctatat atcgttaagt 23040atctaaagag aaaaaaaaag tataaagcct
gaattttgga acactctaga aacctgtggc 23100atatttctta cattcaatct ttaaaagatc
tgttaagtat caacaatact tactccatag 23160tcatgagtta aaatattata cgtagtcaaa
acaatatcct gttttgaaag taaggccggt 23220tctctaatac gatcaggacc ataataaaca
taaaaattca agtgtacatc tgattttata 23280tgttgtccaa actggtccta aagaaaaatt
aggaatattt ttaacaatga gcagatttgt 23340gtcagactta atagatgtat aaaaaagtaa
ctgccctggc agtagtgtga caaatccaac 23400tactaaaatt gtatgaaatc aagtttcctt
cagtctagac caaaataata acaataagaa 23460aaatgacaca tacacagtaa taataaatta
tttaaaattt acttggccct accaatcagc 23520aactgcttca tttttaggag gaaaaaccaa
agttattgaa aacaataatg agttttataa 23580aaatatttta attagaaggc cctttaatgc
tgcattagtt agcagactta ataatataaa 23640aattaaacag taaaagaaca acaaccaata
tttccaagga tgtgggcaga gattcaagca 23700ctgtagccac tttctttcac tcatttatta
ttggcctact atttgccagg tatcaaccac 23760aatagtgatg taactaaaaa tgaagaacaa
ataatcatcc ctgactttct tgaggacatg 23820atcagtttta aaaacttatt attatttttt
taatcatgaa aaaaaccaga cctagaagat 23880tttcagcaga aaaaaatagt ctgattaaac
tgaaaagaga agataccagg gagacacagc 23940tgggaagcta tgaggagaga gatgatgaag
tcctaaacca gcaaggtaga agtaggaaca 24000gggtggaaag gacaggtgtg aaaaacagaa
actaagacaa gactgatagg acttggggac 24060taatgaaata tgaaatatga ataataataa
taagactaag atgctctgat acttgaatga 24120ctacctagaa ttgtgtgata aaaatataag
gaagtttaaa aaaaaaaaaa aagaagggaa 24180aagtgggcat ccaagtaaga aagagatgaa
tattttagac ctttatcatt tttggtcact 24240atagaatacc taactgaaga cgtatactag
gaaaatgtaa ataattacct atagatagaa 24300gtgagaggtg tagatatgaa ttttaaagtt
aacaacagac agtatttaca gctgtaagaa 24360tgaatgaaat ttcctaaaga gagcatgtac
agtgagcaaa cgttagggac agagtctaca 24420gaataccggt atgtaaggag aagacttgtt
tttattggga gaaaacacat gatgcctggg 24480gtataattaa aaatattcca ggccaggtgc
agtagctcat gcctgtaatc ccggcacttt 24540gggaggccaa ggcgggtgga tcacttgagg
tcaggagttc gagaccagcc tagccaacat 24600ggcgaaaccc tgcctctact aaaaatacaa
aaattagctg ggcatggtgg tgatggtgat 24660gcgcacctat aattccagct actcgggagg
ctaaggcatg agaatcactt gaacccagga 24720gacagaggtt gcagtgagct gagactgtgc
cactgcactc cagcctgggt gatggaatga 24780gattctttct ttaaaaaagg ccgggtgcag
tggctcacac ctgttaattc caacactggg 24840aggccgaggt gggcggaacg aggtcaggag
ttcgagacca gcctgatcaa catggtgaaa 24900ccccatctct actaaaaaaa caaaaattag
ccgggcatgg tggcatgtgc ctgtaattcc 24960acctactcaa gaggctgagg cagcaggatc
acttgaacct gggaggtgga ggttgcagtg 25020agctgagatc gcgccattgc actccagcct
gggcgacaga gcaagactgt aaaaaaaaaa 25080aaaaaaaaaa aaatcaatcc agcaaagaaa
aaaaggaaag cacatgtgtc agacttttgg 25140taatttttga atcttgatga gtagatggag
atctcattaa aattttttaa aatactcttt 25200ttaaaaaatt atatgttgta tggaaataca
gaaaagacag atttaatagg aaccagaaca 25260aagacagaga aaaagttagg acaacaaata
taaagggtaa aggcctggga tccaaagaat 25320ataaaagttt caagaaaaaa aaccaaggtg
gacttttaat ttgtcatcta tcttcattcc 25380agttctttac aagaaggttt agagatagct
tacaaaataa gacataacaa ataataaatg 25440aataacatat tcaaggcaga ggaaaagggg
agctcattgt acatggcaat attgaagtct 25500gaagatctct gcaaagggag tttctgtagt
gagtggaagc aaaaaccaga ctaccaagga 25560ctgaagaaag tattaaacat gaaataaagg
caaaaatgca gagaggataa aacatgaaaa 25620agtaaagctg aaaatgcaaa gaatttctta
tagaaatttg gtaaagaaga aaaggaaatt 25680agaatccagc atgacaaaga atccaatatg
agaaaactga atcaatgggt ggccttgaaa 25740tgggacagtg cattatttgt aaagggatta
cacttggaac agaagaaaat cagcttttct 25800ggagaataaa gggagagaac tatagttact
ccatctctag acagtgttag gctctactgt 25860atatgtatgt atgtatcatc ctctcaatag
aaaacgaaat aggtgttatt tcatgagatt 25920acttgtctaa atatccagaa caaatatgtg
gtagaagtag gattggtacc caggttctat 25980tcctttacct ggattatttg aagacaaaaa
agggaatata agtcagttct agcctgctct 26040gtagaatgct agatccttta cttgagagcc
aggaagacgg tatcttgttc agattgtgaa 26100attttaattc ttgatcattg aaatcgtttg
gatgttcatc tcctccaaat ctcatgttga 26160aatgtaatct ccagagttgg atgaggggcc
tggaggtgga aggtgtatgg ctcatagcgg 26220tggatccctc ctgagtggct tagtgctatc
cccttggtga tgagtgagtt ctcactctaa 26280gtccacacga gatctggttg tttctaagtt
tgtagcacct ctccactctc tctcttactc 26340ccactctggc aatgtgatgc tgggtcccca
tcaccttctg ccaagagtgt aagcttcctg 26400aggctgtgta agcttcctgt ggcctcacca
gaagcagatg ccagcacgat acttcctgta 26460aagcctgcag tatcatgagc caattaaaac
tcttaaatta tccagcctca gttatttctt 26520tatagcaatg caagaatgga ctaacacaac
catgaatccc taaatgccaa accacctctc 26580attcctttaa attcacagag tgtgcatgat
ctcacttggc aagattttgc cacactttcc 26640tttctgaact ttacattact tgaactttcc
taaatataaa attataatta aatactgatt 26700acccatttta aaaatacatc tctatacccc
aagccctcaa ataatcagtt agatagttta 26760tacctccatt aatactctaa acctcatccc
aaaactgcca ttcaagtacc catgtactat 26820ccttcccata acaaacaaca aattaaagaa
agaacagttt cgttggaata aaaaaatagc 26880aacagaccat taacattcat tcttctggtc
ctaagaggaa aaatgataca cttatcagga 26940caaggtgtaa tttaaaaaat aaataaataa
accaaaaact tatataaaaa aagataattc 27000atgtcacgga aaaaaagact ataaagccaa
gaagaggcca ctaaatttga caattaaact 27060atcacaagta gtcttttcca atgcaattta
gagaactaat ggtacaagcc agattacaat 27120gaattaagga attagtaagt aaaagcacaa
agtagaaatt aacctccaag agtatgacag 27180tgaaggaaag cagaagagta gcaactttta
aagatgcata tcaagagaat gtattttaaa 27240atgaaaaaac ctgagtatgt atcaggaaga
gccagtggga agagactaaa taaacaaggg 27300aaaaaagaac agatgagcaa tgtgggaaaa
gagagaatcc aatgtagaga ctaaatactg 27360gtcttaaaaa gagtaggatc tcttcctcct
ctgacataag aagaatggcg atgagagata 27420aacagataac ttggaagtgg caagaacggt
gagtggtcct tatattcaca gggattccaa 27480gagagtcatt atttcatcat aacaagaagt
ttgaagctca aactaggaag aaaacatatc 27540caggctgcaa ataaacaatt aatcaagagg
ttccattcca atgtcaatgt ttcagaacag 27600aaaagcaatc tggaaaaaaa aaatttctga
tgggtagtgt taaaaggaca ggctctagat 27660tcaaaatgtt ggagactata tacaagatcc
atcatttatt agctggatga ccttaggcaa 27720cttaacctct ctgtgtttca atgtgggcac
ctggaaaatc agggtaacag ttatgagaat 27780taaatgagaa aaatccttaa aagaggttaa
ggacaaacca aataaacagg agttcactaa 27840ctgttgttat tgttataaaa ataagttatt
gttggtatag ctcttagaaa agacagttcc 27900aacagtcaat ctgtaaatgt agaggagtca
aactatacat ctttcccact ttgggggaca 27960aaaatagcaa aggcagaaat acaaaccaaa
tcttatgtta gcaattgatt atgttccttt 28020accttcttct ctactacaca tcttcaaaga
aaaatttctt caattcagca aacataatcc 28080aattcacctg atgattattt gaacttagat
acgaactgat gaccaagtat atttacccca 28140atgaatgact ctttaacaga atactttcta
ccttgtagat aaaatataaa ttatgttaca 28200tttttaaagg gcttaaagac ttacaatcca
gttgcttaac acagaaagcg gacagatgat 28260cagtgttgtt cttggtctct cctcaacatc
agttttcttt gacccctcca ctgcacaagc 28320tcctaaaaaa atgaaccact gatagaacct
ttagctgttt ttctgaaata tatatgtcaa 28380taaggagatg gcaataatgt gcctccatga
aaagataaat taggaacaat taaaatgaag 28440cagatggact tttaacaggg aataaatttt
acctcatcac ttaaaaaaac taaaaatcca 28500catagtgatt atgagaagaa ccaaaggact
atgacagaga gtaggtcaca aggaggaagt 28560ctctaatatt ttctgatagt catagtcatt
cattaagacg aacatttgtt ttatgcagtt 28620ttctgaattt gtgttatatt taatttggta
gccagccaag atggccctca atgatctcta 28680cttcctgata ttcacaatct tgtgtaatct
cctcccatac tccaccagga ctggtctgtg 28740tgaccaacag aatatagcag agaagatagt
atgtcacttc tgagatagat tacaaaagaa 28800gtttccgtat tgggtgtgca agctcactct
catactcatt ctctctctcc aatctttcac 28860tctggtggag ccagctgcca cgatgtgagg
ccactcaggc aacttatgga aggatgcaca 28920tagtgaggat ctgaagacta cagttagcca
gcaagtaatg aggtctgctg acaactatgt 28980gagtgaactt ggaaatggat cctccctaag
tccaatgttc agatgacggc ggcccctgaa 29040gaagagtttg actgcaacct cgtaagagac
cataagccat taaaccatcc agctaagcca 29100ctccgacttc cagccctgtc aaagtgtgtg
agataataaa tgtctgattt taaatggcta 29160agttttagga taatttgtta cacagtaata
ggtaactaat atattacaac aaaaattctt 29220aatacttttc tgtggagcat gtttaaaaat
agtgataata aagtgttttt acaaattctt 29280gaaataaatt ctagagatgg tatcccttac
caatggagta tcttccagta aacttaaaat 29340gcaaatttct gagtttttaa aaaattgtat
tttcacacta atttcaaaat taaactggaa 29400ggtgagcaaa gtatcatact gtatctacgt
atcattcaca acttgctttt ttctctcagt 29460gctatctaca gtagtccctc cttatccact
gggggtatgt tccaagaccc tcaatagata 29520cttaaaacca taaatagtac caaaccctat
atataatgtt ttttccctat atatacctac 29580aataaagttt aatttataaa gtaggtgcag
taagagattc gtaacaacaa agaataaaac 29640agaacaatta taacaatata cagttgaccc
ttaaacaata tgtttgaact atgtgatgca 29700cttacatgca aatttttgtt aaccaaacgt
ggattaaaaa tacagaatgg gccgggcgcg 29760gtggctcacg cctgtaatcc cagcactttg
ggaggccgag gcgggcggat cacgaggtca 29820ggagctcgag accatcctgg ctaacacggt
gaaaccccgt ctctactaaa aatacaaaaa 29880attagccggg ggaggtggcg ggcgcctgta
gtcccagcta ctcgggaggc tgaggcagga 29940gaatggcgtg aaccccaggg ggcggagcct
gcagtgagcc gagattgcgc cactgcactc 30000cagcctgggc gacagcgaga ctccgtctca
aaaaaaaaaa aaaaaaaaaa aaaaaaaata 30060cagaatgggc caggcatggt ggctcatgcc
tgtaatccca gcactttgga agactgaggc 30120gggtggatca tgaggtcagg agatcaagac
catcttggct aacatggtga aacccgtctc 30180tattaaaaat acaaaaaatt agccgggcat
ggtggcaggc gcctgtagtc tcagctactt 30240gggaggctga gacaggagaa ttgcttgaac
ccatgaggtg gaggttgcag tgcgccgtga 30300tcgcaccact gcactctagc ctgggcgaca
gagtgagact ccgtctcaaa acaaacaaac 30360aaacaaacaa aaaaacagta tctgctgatt
gcaaaactca tgtatatgga aggccaactt 30420ttcatatata ctcgtcctgc agggcagact
gtgggacttg agtatgcgca cattttggta 30480tacacagggg gccctggaac caaatccctg
acaaatactg agggatgact gtactgtaat 30540aaaacttatg tgaatgtggt cattctctgt
ctctcaaaat accttattgt atataatatt 30600ttagaactga aactgtagaa cgtgaaactg
cagataaaga acaactgtgt ttgggattta 30660cacttattag tatttgtagc tctagtgtat
tcattttaac caattcatag catcatactg 30720caaagtagat tacaatttat ttaccattcc
cctactggac aacatgtaca ttatttctaa 30780tattccatta aaataatgct gcaatgaaaa
ccattttttc atgtctcctt atttatattt 30840acagagttac agatggtcta tattcagctt
cattatattt ttgtccagtt gctctccttg 30900gtaattggcg ctcccatcaa caatgcacga
attcctattg ctgcacaact ttgtcaatac 30960ttggcattgt gagatttttt aatgtctgaa
aatatgaagg gtaagagcat ccccctattc 31020ctttatttta tgtttttgag acatggtctc
gctctgtcac ccaggctgga atgcagtggc 31080gctattatgg ctcactgcag ccttgacctc
ccaggctcaa gtagctggga ctacaggcat 31140gtgctaccat gcctggctaa ttttgtgtat
ttttttgtag aaacaggatt ttgccatgtt 31200gcccacccct gatcttgaac tcatgggcct
cggcctccca aagtgctgca atttcaggca 31260tgtgccacca tgcctgctgc cactattcct
ttaatttgca tctccccaat aattctgaat 31320cagcatattt tttccagtgt taactggtta
tttgggtttt ctttctatgg actccatatg 31380tacatccttt gactattctc ctattgggtg
gtctttttcc aattgtttgg aaagttctta 31440catagtctag aattgaacct tttgtcagtt
acacttaata cacattatct tcttcagttt 31500tatcattttt tactttgttt atggcacctt
ttgatgcatg gaaatttaaa atgtagtggc 31560atttaatcaa tcttttcctt atgacctttt
gtgtcttact aaagagatca ctatctacca 31620caaggacaat caggattatg gcttctgtct
ccattcacag agttttgttt ctattctgct 31680tctctttcaa agatatactt agtttggtta
aaaccaaaat gacacacagc tgtgactttg 31740aaaatgttct ctcttgtatc acagctatgc
atctgcaact gaaagtggtt tctaatgtgt 31800aatttaattc tatctcaacc gaaatttcca
catttcttta aatttctaaa attagggtaa 31860agaataatcc acccaacatg cctatatttt
ctattcctaa ttatctgatt cttggaccaa 31920aagtcggctc tttctcatga gtctatgtta
gcttcataaa caagcccctt caggtcaatc 31980tttacttcat tatcctcctt tttcttcaca
ttacttctca ttatttaaaa ttatattgac 32040ttgcttattg tatatttctc tcatcattac
atcatatgct atatgaaaac agatagcctg 32100ctatatcccc aacatgtaaa ataatggaga
ccaaggggca gagatataat ttgtctcaga 32160tcttctcaac cttagttcaa tctcttccca
tctcaaatat ttagtctctt aagcagtgtt 32220gttttcctat gaattacaaa atataaaatc
ataacaaaga tcactaacat tttaaaatga 32280aaagcaaatt ctaataccaa aaaattaaac
ctttaactta aattaaccaa aattaaatta 32340agctttaaag cagccacaaa ttgctttgtg
ccatcactag aagaaatatc tctgctattc 32400agaggagtaa gggcccccat cacattttgg
ccctaactaa aacaaataca gtaggaaaat 32460gcaccaaaag gaaaacaaat tatgaggtag
ctttgttctt ttaatgggtt atatacttta 32520aaaaatagtc tctattttct aaacttcttg
taatatggtt ttataactta aaattttaaa 32580ttttttcatt caaaaacaga aaaaaaacca
aaaactagaa aacaaggata tttactgacc 32640ctttttcaac attttctttt ttgttgtagg
aacagatgaa gttaatgcac atgcaaatgc 32700cacatcttct ataaccttag aagatcctgc
tgataaaaca acaaagaatc ttaaattttt 32760ttcaaattat ctcctgtgct agagtacagg
tactttctaa gggtaagaaa tagaatatca 32820ttttcaccct ctattgtaag agcttatatt
attaagttaa aatatttggt aaccaaaatt 32880actacctctg acactaagga tatcttttca
actcagagaa acgatgtagt cattcatggt 32940aagacttgag gagaagaaat gaaagtagga
gcagtcatat gcaaagtaat tcggctctca 33000ggtggaagga gaatttttca aggaaacaaa
cagtttgatt cctaatgata aggctgccaa 33060attttcagat cagtcaagat gatcagttat
tttccaaatg caatacaatt tctcaacgaa 33120tgagaaatat agtatgatgt gaattacagg
aaaataaaac atattttaaa gaaaaagaaa 33180aaatttgatt attttcacat ctgtataata
atgtaaacaa taatagtatt gccaggaaga 33240cattctcaac ctaattttgt aattactatt
catactacac acataggttt tattgaagtc 33300attttatagg tagtagaaaa ttgttttaag
tatagtttaa atattattat cccttagatt 33360ttaatttgca actatttatt cacttttcct
cctttacaaa attaatttaa aacccatggt 33420tagctccatg aagagttcaa atgtaaacat
cagccaagtt taagagctat tactaaaaca 33480ttccaaccta gcgagataca aatcaatgga
gctgtacttt aataatcttg aagtctagat 33540tatgggtata tagtatacac atacctttcg
ccctgccttt agtttcagac tgtacatttt 33600tcagtttgcc taaaaataaa acaaaaataa
acataaaaat gggcaaaaca ggacattaga 33660atataggaca cattatcaaa tcaaatctat
tacatgttac ctttcatttt ctgcggcaat 33720tcacttgttt caatttcctc tgaatcactg
ctttctatgt actggacagc agtttttctt 33780ctaaaattaa gtatacacaa agaaattttt
ggaccaaccc aaagcaaaat aagatataca 33840aacatgttta ataaatgcaa taaaaaactt
ttgtcctact catttatttt ttgaaaaatt 33900ttttaaagat tttgacaaat gaaagtcaga
tctgctaaaa gcattctgct ttcataacca 33960gatagatgca ttaactatac aaaatggagt
accaccgtta ttgttttttg agacagggtc 34020tcactctttt gcccaggatg gagtacagtg
acacaatcag aagtcactgc aacctcatac 34080tcctgggctc aagtgatcat cccgccttag
cctcgggagt acctaggact acaggtgcgt 34140gccactacac ccagcgaatt tcactattat
taaaataaat ttgatagccg ggcgtggtgg 34200tgggtgcctg tagtcccagc tactcgggag
gctgaggcag gagaatgacg tgaagccagg 34260agacggagtt tccagtgagc ggagattgcg
ccactgcaca ccagcccgcc tgacagagcg 34320acactccatc tcaaaaaaat aaaataaaat
aaaataaaat aaaataaatt tgaaaaacta 34380agtcattcta aaaaattaaa aatttcaata
ccaggtttaa tatttggtag ttcaattaaa 34440gcatataaaa ctggaacaaa aaagaattgg
ctcgtggcca ggcacggtgg ctcatgtctg 34500caatcccagc actttgggag gcggaggtgg
gcagatcacc tgaggtcagg agtttgagac 34560cagactggct aacatggtga aaccccgttt
ctactaaaaa tacaaaaact agccaggtgt 34620ggtggtgcgt gcctgtagtt ctagctgctc
aggaggctga ggcaggagaa ttgcttgaac 34680ccgggaggca gaggttgcag tgagccgaga
tcatgccatt gcactccagc ctgggcgaca 34740ggagcaaaac tctgtctcaa aaaaaaaaaa
aaagaaaaga aaagaaaaaa gaattggctc 34800ttatcaaaga aaatacttct aacacaatga
aataagcagc caaaaggcaa aaatgtaatt 34860ctttcacaat aggaaatttc ataaattttg
tgtagaaagt ttaacagtga ttaagtgcct 34920aagagcaata aattctgaag tcagactgcc
tagctttaaa tcctcattcc accagtaatt 34980tgttatgtga ttttaagtaa attacctaat
attcttcaat gcctgtttcc tcataagaaa 35040aatagaaata aaaatataat cctcatagaa
taaaatgtta ctacatgcaa aagcccctaa 35100aagagtgcct gttacacaat aacataacta
ttagctataa catgtgtgtc agctgttgtt 35160aagtttttct catgtcctca ccattattat
ttctatgtcc tttaatctga tcttgtccaa 35220ttaacagtgt tgtgttaccc aggtttatga
taagttcaaa gttgtaacat ttgtcattta 35280tctaaatctg aagcaattag atttagataa
cactgatgag tgctggcttt gtgcttagtt 35340cagacagcta ttaatgacag aacaattaac
atagccagtt tcaaaacctt tgttgatgaa 35400atagctagca gagacaaatt tgcgatgact
tgtgtagata ctgatggtag acaatacaac 35460aaaagcccaa tttttagacc tacttcagaa
gataaatgaa gtttgtaaag ttaacagatt 35520caagacaaga tcagagggga attacatctt
ttattcaaat tctccctcta gtctatgtta 35580atataatact gtccacaaat aactttatca
tgtcttattt ccactacatg tgaagaactg 35640aaacgatcaa gaatgctttt gtcaagttat
ctaataaaat attcttcagt tttcatcccc 35700ttaaatccat aaagaaaact tctaaaatct
caaaaaaaag caaaaatccc aagatctaga 35760tcttagttcc ttgacaaaat agaaaaaaag
tagtagaaag atgagtgatt attaaaattt 35820accaattcat ttcttttata ggtcagtatc
attcttcttc atctgtaaaa tgagaatact 35880ctatctgcct aagtcatagt gatcttggta
gtttcaaata aaagaatgta tttgaaagaa 35940ctacttaaca aatataaggg tcatggtcat
ctctatctct ctctttttct tttttttttg 36000agacggagtc tcactctgtc acccaggctg
cagtgcagtg gtgcgatctt aactcactgc 36060aagctctgcc tcccgggttc acgccattct
cctgcctcag cctcctgggt aactgggact 36120acaggcgccc acgaccacgc ctggctaatt
tttgtctatt tttagtagag acggggtttc 36180actgtgttag ccaggatggt ctcaatctcc
tgacctcatg atccgcccgc ctcggcctcc 36240caaagtgctg ggattacagg tgtgagccac
cgtgcccagc ctctatctct tttacacata 36300agaaaacaga gtctaaataa acatatgaca
taatcaaacc ataatagttt accttttggg 36360gcgggagcta gacaattctg acatgcgaaa
cttactcttc tccttgatat ctgaaatact 36420gggttgttca ctacatctag atgcgtctat
ttcaaagaaa aatgcaaata taaagtatta 36480gtaaggtgtc ttagaaacta gtattatccc
tcttaaaacc ttggttttct aattactgat 36540tttcagtttt cttaagatct tcattccttt
cattactcta ttttctctac ttttttccta 36600atctagtatt tcctcatttc taattagaaa
gtaccttcaa atttttatga cccagtcttt 36660tctttgtact catccagccc atccccacac
tagctgaatg tcttgctgta ccaaggcagc 36720taagagctat tggaaaaaaa aatcaacaga
accacaaata ggtgctgtta taaatttaca 36780tgttcaaatc tcagatgtca ccttgtgtca
cttggtaact ctttttacta aacactgttc 36840ggctccccta cccatttatc agaatgatcc
ataccagaat gatccatact acattgaccc 36900ctttgcctag attataactc cttttattcc
aagatgacct atttcacttt acaaagaaaa 36960tgagaaggta tcttccataa aaatctcaat
tttcctatct tttaatccac tcatttatct 37020atactttcct tcaaccttac ttctttcctt
ctactgctaa ctcatctacc taactcatct 37080aggtagatga gttagaccac agaacaaaat
caaaaacaaa aaaaacctca tcttacccta 37140actcgccacc tttctatttc tttcctgttc
caaacttaag taatagcacc aattgttatt 37200tacaattttt cacctactac tcttaaaata
ttttcagtct gctctaatct atctttagaa 37260ataagcttta tgagggtcat ttaagtaaaa
tccacagatc tctttccatt tctcctatga 37320cttcagaatt tggctctaaa gctagctttc
tagcattatc tgctgttatc catgcatatt 37380acattttaaa tacaacaaaa ttcactgagg
tcccagaata aaacatgctc tttttaaagt 37440accctttcat gagatttttt ttcacaaccc
aaaatgctct tccccagttc tctaaatgtt 37500atactaccca attcaatcat ctccttcatg
aaacctttcc tatcagtact cagggaagac 37560cttttaaaaa agagatccta aatttttgta
cagtacaaag ccaaaactag aattacaata 37620ttaggctgaa aaaacgcaaa tcaaatattg
ggtcatattc aaattaaagt ttaccagatc 37680aaatattgga tcatttcaaa attagcattt
acctttgctt agtccatctg ccttttcact 37740ggtattgttt cctccaagtt tcatagagtc
atcgttaaca ttatattcct gggtaaatag 37800gcatatttct taaacagtac tgctatcagt
tttaaataac tttaaatgca tcctgtttta 37860tattgcttta ctttgcataa taaaaacgac
ttaaattgct acccaattct ggagcaactg 37920aaggggaggg tgaataagtg aagaggaaaa
tatacaaact gatttaaaat tttttcacgg 37980gttatctcaa aaaaaaaaaa ataaataaaa
ctacttggcc ttcacctttt cgaaaggaat 38040accgaaagca aatctcaatt ataaatgaca
cattattgtg agaaaaattc aagaactggg 38100atttacttta taattaagaa ttctcagact
tcaaactttt caaaccaatg ttccatgttc 38160agatctttaa tgaggctgcc attgtcatcc
tctctttctc tagaatttgc agggcaaata 38220actctaatat ataatagtac aggtcccaga
gtgtcctctc ttcttactcc atggggtatt 38280ttcctgctag cccattttct tttcttttcc
aatgtacctt tgtttccaaa ggcaccaatg 38340tacatgaatt gaaaacaatt ttataaaatt
tgaattgtaa gcacttaaaa aatcccccct 38400tagagttaag tcttactaaa aatgtggttg
aaaaggttca tcctctttaa aggtaccaat 38460caaaaaaaac taaaaatctg ccgggcgcag
tggctcatgc cagcactttg ggagaccgag 38520gcgggcggat cacctaagat caggagctcg
agactagcct ggccaacaca gcaaaaccct 38580gtctctacta aatagacaca gattagccag
gcatggtggc gggcacctgt aatcccagct 38640acttgggagg ctgaggcagg agaattgctt
gaactgagga ggtggaggtt gtagtgagcc 38700aaggttgcgc cactgcactc cctctgtctc
aaaaaataaa attaaattaa tttaaaatcc 38760caaatacaat atttttagag atacgtaact
agaagagtta ctttagacag ttaatgtcta 38820agcattagct ccaagaaaga gatggtaaaa
atgcttttca tcaaacatcc tatcaagtct 38880acaataatta ctataaccat ttatattaaa
cagtatgcaa aaaaactaaa tgtttattca 38940aacataaagg atgtcctttg cattcatctt
cttttttttt tttctgagac agagtttcac 39000tcttgttgcc caggctggag tgcaacggca
caatctcggc tcattgaaac ctctacctcc 39060tgagttcaag tgattctcct gcctcagcct
cccaagtaac tgagattaca ggtatgcgcc 39120accacaccca gctaattttg tattttcagt
agagacaggg cttcatcatg ttggtcagac 39180tggtcttgaa ctcctaacct caggtgatct
gcccgcctca gcctcccaaa gtgctgggat 39240tacaggaatg agtcactatg cccagctttg
cattattctt tatattaaaa tattttgctc 39300agcaaacttt ttctctataa agttcaaaga
taaacttttt atcaagtcca agctggatga 39360aataatagta tgttagaatc taggggaatt
taaatcaata tttcagtcta cttatgtttg 39420gaatatctaa acatacttga aagagtgaca
attttttttt tttttttaat aaatccctgg 39480ctggaaatca catcttcagc tgccatttca
attaatgtag caacccaaat attctaaaac 39540ctttctacta taaaacacac tttaaatata
taacttatct tggggaaaaa taagagaaat 39600ccttattgtt aattgataat ttctcaatta
ataattgaga ctaaagcagt aagtagtctt 39660atgtgcttct atcaaatctg gtaacctaaa
acttcagttt caataaccaa atgtaggaag 39720aacaagacat aaaacattag gttgctcagg
gtgaggaatt gaacctgaga cttcaacagg 39780tttaactctc aataaaagta gaaagtagga
agtaaaacta agtgcagaca ttaataaaat 39840aaaatctcaa tagcactgac aggaaactga
tgtactgcct tctctgaaaa ctaaaaacta 39900ggaagaatgg taattcagaa gagttcagaa
aagaagtata ttgcatacat ctttttgcta 39960tttccttcta tttaagttac ttgaattttc
aggtaagaaa agaagtaact gttcctctct 40020agtaacattt atttttctca ctaaactaat
ttttaattct tagaaaccaa atataaatgt 40080atgtttatat tatatacata tatatatgta
tatatataca cacacacaca caaattttca 40140agaaaagtca cccttattct catcactagg
acataagtta ttattaacat tttggtacag 40200atagtgtagt gaatggactt aaaacacaca
tgaatatatt atttacaaag tgaggaacca 40260aactccacac aaagatttgt acctgtcatt
ttttaactaa aaagaatcat gaaagtgatt 40320attcatcaaa aataatactt aatggttaca
taatagtaag ttttatggct atatcgttta 40380actgttccct tgtcaaaaga tgtttcaaat
tgtccacctt tattttattt ttatttgttt 40440tagagacagg gtctcacagt gtcactgaag
gtggagtgca atagtgcaat catagctgac 40500cataggctca aactcctggg ctccagcaat
cctcctgctt cagcctccca agtaggtgga 40560actacaggtg ccagtcacta tgcccagcct
ttaaaacaat gttgagggaa aaaaaaataa 40620taataataaa aaaaaacctg tattttcttt
ggaaaaaatg aatttctggt aactcctttc 40680agatagtttc ttagaaatgg cattattgac
tggcaaaatt gttcaaagat tataaacaat 40740tttaagactc ctgatacaaa ttatcaattg
ttctcaagaa tgtgatacct gctggtggct 40800cacacctata gccccagcac tttgggaggc
tgaggcaggt agatggcttg agctcatgag 40860ttcgagacca gcctggacaa catggcgaaa
tcccacgact tgagcctggg aagcagaggt 40920tacagtgagc tgagattgcg ccactgcact
ccagcctggg caacagagcc acaccttgtc 40980tgaaaacgta aaaagaatgt gacacgaact
ttcattctta tcagcaacac ataaaagaca 41040gagagaccat tcatcgcatt tctgaaaaca
caatttcttg gtagtttttt ttaatgatga 41100aatttaaaga tttcacaata aactggaggt
ttcacataaa gcgcacttac tactacttta 41160ccttcttcag tagattcttt ttaactcttt
caataggaag aggtctgcca tcatggaagt 41220tggtaaggat tactgcaatg gccgtaagag
ttttaccctt aaaaatgttt taaaaagata 41280aatggtcaga ttgtgaaacc cagttcacca
gaaaaacgtt cccattttaa aatcaagtaa 41340tcgaatatca aatatcattc aaggtcacaa
aagtgtactg tatattaatg aaaaatgagt 41400atttcattta gttagaaaat gctttccccc
actgcattat tagggaatta ttaaatgatt 41460tacactataa aatactacaa gtatctctga
gaagtgtcat tttattgaat gactgctatt 41520ctacaattct acatttaatc ctgaactctt
gatttttatc atgcaactat ttataactac 41580tcatttgcca tgaagttcgg agctaaagtg
attagtaatt ccagtatgca tataatcatt 41640ttattaaaat ttgacttaaa gataatacca
agcttaaaag tcataacttc tccataaagc 41700ataaaagata tatttcttat gcatttttct
taagtggaag tatgtaattt ttcaatattt 41760actgcagctt aattcatagt tggcaataca
ttagtaaatc tctggcaatt ttaaagcagt 41820gtaaataaca gcagttttca aagtgtggtc
cacaggccac tggggatggg gagggtttcc 41880caggagccct tcacgaaaat gcaaagtcaa
aactgttttt ataatactac taagacagac 41940cttactctgt taacatctgc actaatccta
caaaaagccc aaatagcaat ggtcaaaaaa 42000gtgcctcagc ataaaacaaa gcagttggca
ctaaggtata ttagtactag tcatcatatt 42060cttaactgcc acatacagtt aaaaaaaaaa
gttttcctta agaatgttct tggctgggca 42120cagtggctca ggcctgtaat cccagcactt
tgggaggccg aagcaggtgg atcacctgag 42180gtaggagttc aagaccagcc tggaccaaca
cggtgaaacc ccgtctctac taaatacaaa 42240aaattagcta ggtgtggtgg tgcatgcctg
taatcccagc tacttgggag gctaaggaag 42300gagaatcact tgaacctggg aggcggaggg
tgcagtgagc caagactgaa ccattactcc 42360agcctgggca acaagagtga aactccatct
caagaaaaaa aaaaaaaaag aagaagaatg 42420ttcttgacga agcagtacac attaatttta
ttaagtctca ttacttcagt acaaatcttt 42480ttaatattct gtgtagtgaa atggaaacta
catattaact acttctgttg catatcaaac 42540tatgagggtt agctagagaa aaatcactca
agatgggagt tataagctca actagtcact 42600tttttcatgc gacaccattt ttacttgaat
gacaactatc aatcacgact gagtgtctga 42660aagatacttt ctagaaaatg aacaaagtga
ggctgtcatc tacaaaggaa acaactgagc 42720ttgtcaatga taaaatgtga gttttcaata
gaaaattaga atgttggaaa agttgtgtgt 42780gacactatga acatagtgcc ttcccaaaat
gtaaatgatt ttctaataac atcagtggta 42840atattaatga atgtaagttt tggatattgc
atatgaaata catcaacatt tagtaggtct 42900gcccaactca atgaaccaac attttctaaa
tgagcaatgc ctggtgttat aaaatcatac 42960atgggtaaaa tattcattca atgagcaaga
tacacaaata gattttaaca aaacagggta 43020taaaaagctc actgatatga tttagcgttc
catattacaa caattaacct ttaataaact 43080aatatttatt gaattttgat gcagtatatc
aaagaaatct gcaattatct aaaaagctat 43140tacaatactc ctctctttac agcaacatac
cgtcgtaagg acaaattttc ttcatatact 43200tcaaccaaaa caacctaaca gaacagactg
aatgaagaag caggtgagaa ttcagctgcc 43260atctattcgg ccaaacacag agatgtgcaa
aaatgtgaca atgccagttg ttgcaacaaa 43320aatgttaatg ttttggcata gttatttttc
ttttaaaaat atgctattta tgttaacaag 43380caatgagttt gtactgctat ctttagctac
cactgcctaa tactataact actgataaca 43440gcacacaaaa gcaaaagctc tttgaaatca
gtaattttta tgagtagaaa gggttactga 43500gactgaaaaa tataaaaact gctagtgtaa
ggacacataa gtatctattt aatactctgc 43560ataacacaat atttgtgatg gaatacaatt
tcacataaca ttgttttgtc tttacttttt 43620aaaagaaaac ctatgaaaga caggtcaccc
tgaatttcaa agagcaaaga aaattgagaa 43680acatggccag gcagggtggc tcacatctgt
aaccccagca ctttgggagg ccaaggcggg 43740cggatcactt gaagtcagga gttcaaaacc
agcctggcca acgtggtaaa accctgtctc 43800tactaaaaat acaaaaatta gctgggcatg
gtggcacatg cctgtaatcc cagctatttg 43860gaagtctgag acaggagaac tgcttgaacc
tgggaggcgg aggctgcagt gagccaagat 43920cgcgtcactg cactccagcc tgggtgacag
agcaagactc catctcaaaa aaaaaaaaaa 43980gaaaactgag gaacacatct ttataccact
attttaacta aaaactacag gaaaaactgt 44040gttctacaaa gatagtaaga tctagtccca
aactggtacc aacatattcc tatctctttt 44100tacttcaaaa ttactttcta gaaaatcata
atcacaaaat tagattttat taactaaaga 44160aaaaaataat taccaaaccc atatcatcag
ctaaaattcc tccatggaca ttttctggtc 44220ggtccttctc agaaaaattt gttattgtgt
tatagtataa gtcatttcgc tgttcccaga 44280atggtggaag ttctttgcta ttttcccgtg
acaccatcca agctagagct tgtttttgat 44340gtggaagcag tggtgtttca atagcctata
aataaaaagt cataaagcga aatacattaa 44400gccattcctt tttcacatgc agatcctgtg
attaaacaaa aagtagattc taagtatatt 44460taaacattct gagtagtctt aaataaaaaa
tattggttca atacctcagc tggttccatt 44520tcatgggttt tatcatcttc ttttaaatct
tcaaacaatt tgtcaaattc tgttttaagc 44580tacaataaac agcaacaaga aacaatgtaa
aacaaactaa ttaaaataat acttgcattt 44640aagtcagatt tgaaatcatt taattaactg
gcgttaatgt caaacgggcc aggagtggtg 44700gctcatgcca ataatctcag cactttggga
gaccaaggtg ggaggatcac ttgagctcag 44760gagtttgaga ccagcctgga taacatagtg
agagatccca tctctacaaa aaaattttta 44820aaaaattagc caggcatggg ggtgcatgcc
tgtagttctg gctcctaggg atgctgagat 44880gggaggatca cttcagccta gtaggtagag
gatagagtga accatgatta tgccactgca 44940ctccagcctg ggcaagaaag tgagaccctg
tcccaaaaaa ttaagaaatt gaaattaaat 45000taaatttaaa aaccacaaac tagtcaaacg
ccaaaaaggc acataaatca actctagaat 45060tacactactc actgaagagc gaatgaacag
gtgctttagt tgttaggagg taaaagaata 45120taaaagaatt ttaagtcccg agtttcagtc
acagttttat cacaaataag cttatatgta 45180agtttcagca agacttgacc tttatggccc
catgtcttat tcctataaaa tgaaagggat 45240gaactaaatt tagattgtct ctaaggttcc
ttctgccatt aaaattctgt gatgctatca 45300gttgtattca aaatattttc tcagtcctac
caggtcaatg acattccatt tattattact 45360gcaataagga acttgccatg cactatagga
tacttttatt attatgctaa cttaattgac 45420tatttaaact aggttagtcc aattttatat
caacatatat gtgatacaga tatttttgaa 45480aattaacaca tgaacacagc agcagctcag
aagaaatcag gccaggtcta aaaaatcaga 45540agtaaccggg ggcttatcta ccagaaacaa
gctcaagtcc ttctaacaat aagcagggta 45600aatatggtaa ctacaattag tagataaaag
gactaacact ggtagataca aacatcaaaa 45660aagttttaat gttattttta aattgatctg
cattaattac tttatatact tttttttcaa 45720agaaattcag cataactcat cctaaattac
aaattagaca atgtgaagaa tttggagtta 45780aatctttagc actgcatttg caaaattact
gaaaggagtt aagtctcagc ttcaattttt 45840ttcttaaaat tactacttga acatttaaaa
catttcagat acaaaaatat gccgtgcata 45900aacaggaacg tgtgtgtttt gtatgttggg
tacaaatgtt cacgtatgga tttattcctt 45960atcagtactt tttatataaa atgttaacat
atatatattc tatttattct ataacatgtt 46020tcattataac ttaatatatt tgaagaccac
aaatacccac acgtacatat tcatcctgtt 46080cttttaaata caaacctgca acatttaaaa
ttcactttta gattacaaaa taaaagagaa 46140gcacacctgt tcagttgtca tctgtactgc
agcatgcact ggcatactat agcttggtcc 46200agctcttcca gagccccaac cactttccaa
attgaatcct aaagctataa tttacaaaat 46260aaaaagaata aagccatcaa ataaagtagg
tttttatctt ttattaagta gcttttaaaa 46320tatgcattaa cagggctggg tgctgtggct
catgcctgta acccaagcac tttgggaggc 46380caaggcaggt ggtttgcctg agtccagtag
tttgagacta gcctgggcaa catggtgaga 46440ttctgtctct acaaaaaata caaaaattag
cagagcgtga tggtgtgcgc ctgtagtccc 46500agctactcag gaggctcagg tggaaggact
gctggagccc aggaggtcaa ggttgcactc 46560agccactgat cgtggccact gtactctagc
ctgggtgaca gagcaagact ctgtctcaaa 46620atatatacac aaataaataa ataaaacgtg
cataacaaac tgcattaaaa gccttgtatt 46680ttagaggtat tttaaaagct aaatactaaa
tcactaaaaa aacaagtaat tccacattat 46740ttgaacaaac tcttgccaag caatcctttt
cattcgctat cagaagatta tttttgccta 46800aaatttatca aaaagaatgt ttaaagtcag
gggtttccag atgttaccat tagggaaaat 46860taggggaggg tatatatgag acttcattgc
atatatactt tttctttaat ttttttaata 46920tcccctcact cttactgtgt atatttttat
gtgtgaatct atttcaatag aggcaaaagg 46980aaaaatcaga gatttcctca aaaagactcc
tgaaaaaaag ttgtaccttg gagccttgag 47040cacaaattag ccaacaaact ccaagagaga
ataaatgttt cataatccct gcatcttaca 47100atatcagaat aatattaagt gaaaccctaa
acttactttt tggtgcagga cccaatttaa 47160atccatgttt cttcaactga tctgaaaccg
cttttctatt ttcttctttt ccccaaaaag 47220tcatatgcag aggcatggta aaagcattgt
ttgcaccaaa aggaactacc ctattatatt 47280tgggagaaaa agaaagggaa atcagaagca
ttacttttta tccccactaa gaattagttc 47340aatattttaa gtgtatcatt tacattttac
attgaagtaa agtaataagg aactagttgg 47400gcagacttga aagaaaaaaa ggaaaagaaa
tagaaccata aacaatcaga agggtgattt 47460ttagaaacag agtgccagaa aaaattgaga
aaatggaata aagcagtttt aaaaatgcaa 47520ggatggcctg taaacaagat tgtaatataa
gtaaaataag ttaaatcaaa gggtagagtt 47580tggctaacgg aaccactgag ggtagtaact
cattctgctt ggaaaatggg aaatgcaaga 47640acttgtgatt cactaagtta ttctcaataa
ctgaaaaacc tgataaactg aaaaaggaga 47700aaacaaagtg agacattatg agcatttatt
ttctcttcaa actcacctag gattatatga 47760atatcctaaa aattagaatt agaatatcaa
gatcctaaat attccagata cttaaattga 47820gatcacaagt acaaatgcct aaaagagaaa
aggtaactga aagattgaaa gagtttagaa 47880aaagaaaaaa gaaacaggtg ataaaactgg
gagagcatat gccctagcta aagggagagt 47940tccagtcaac tattgtcata tggaagtgac
aactcatcct gcattgccag atttcccact 48000tgtcaagaga aaaataagtt ttttgtattt
aagaaaataa attaatatgt ataaaatgcg 48060gacttagcta tttcaataat tagtactgaa
cgaaaaaatg tctaaaacac tgcagggcaa 48120aatgaaattc acctacatca tatcacagaa
taccagttta taacctctga tataaacaaa 48180cacttccagg ctgggcacgg tggctcacgc
ctgtaatccc agcattttgg gaggccgagg 48240cgggcagatc acctgaggtc aggagttcga
gacagcctgg ccaacatggc aaatctctgt 48300ctctactaaa aaatacaaaa attaattggg
catggtggca ggcacctgta gctactcagg 48360aggctaaggc agggagagtt gcttgaaccc
tggaggcgga ggttgcagtc agccgagatc 48420acacaactgc actgcagcct gggcaacaga
gtgagacttc gtctcaaaat aaataaataa 48480ataaataaac aaataaacac ttccaaattc
ctaaacctag gtaccaaaag gtttggagaa 48540atggaatgta cttcctaata tttgaaccaa
ggtctatttt ctccatggta ataattccat 48600aggaaaatga gaggacaatt gtttgatggc
tggggtagta gaagtatgta aaaggagaaa 48660taaaacattt atttaacacg taccaaaagt
acagtgatag aaattgttag cataccttta 48720taggctctaa caagaagccc taacaagttc
aaagatgact ataattcaca attattaaaa 48780ctaaattctg aaagtacatt acccttcaat
ttgtgccaat ttgttgtcca tgatataggc 48840caaagcacct gcaagctctt tctttaaatg
gccaacttga tttccattca cattgtttac 48900tttaattgca ttcttatcat aagggttatt
aggatctcgt tgtaatgcaa ccatttcatt 48960attattaacc taataaaaat gataaacaga
taatattaca ttataaataa tagacaataa 49020ctttgttata ctttatctgg gatttccatt
cattttttaa tgctactgct aaacactgca 49080cataagaaaa atatattttc tagacacaaa
ctacagaaat attaatatat attattatga 49140ctccttaata ttatttttct aatctatatt
cattaggaag ttaacaaatc cttaatgtct 49200taaaaaatag tcaaatgtaa actagcaaat
tcaaattatc taaaattttc aacatcataa 49260cttctgaaaa atagccaacc cttattgagc
gcttactatg tgccaagcat tggtctaagt 49320acattgcaca gactgacttt ttaaatcctc
acagctctat tgggaggtac tatcattatt 49380cccattttac aaatgaagaa actgaagcac
agaaaagtaa cttgcccaag gtcacaaagc 49440taaaaggtag cagagctagg atctgaaccc
acacagggtg gttcctgagc ctcaattctc 49500cagaatataa aagtcaggga ataaggcatt
tataagtcag gatggaacta ggtaagaaaa 49560atatatccag ccttcaactc atgccattct
gattctactt tattaaaaac tgtatgattt 49620aacttaaata ttatgatgta tatttttaaa
aactgaagga gggtaacaca ataaagaatg 49680gtagaatgag gactttcaaa aattcctcaa
taaaggcaac aaaaaaacta gataaattat 49740ttgtcagaat caaatgtttc agaacactga
gaactaaatc aaaggcttgc agaaatctgg 49800gtaacattta ttcaagaata aaaagaataa
acagctgaat atcagtactt cagtcccatc 49860cccagcaatc ctgtagcctt taaaaatagc
tcacggccag gcacagtagc tcacgccccc 49920agcactttgg gaggccgagg tgggaagatc
acgaggtcaa gagatcaaga ccatcctagc 49980caacatggtg agacccccgt ctctactaaa
aatacaaaaa ttagctgggt gtggtggcac 50040gcgcctgtag tcctagctac tcgggaggct
gaggcaggag aaccacttga acccggcagg 50100cagaggttgc agtgagccaa gatagtgccc
ctgcactcca gcctggtgac agagcgagac 50160ttcatctcaa aataaataaa taaataaata
aataaataaa taaataaata aataaaaata 50220atagcttgtt gggatcctag gtaaagcctg
gcagtcacca gaaaaaaaga gaatggagtt 50280acagttcttt cagagattca ttcccaaaga
actgttattc tcctgaagtt ccccggaaga 50340ccccacttgc aaggctgact gtatttaacc
tctgagctca ccaagtacaa aaaaacctcc 50400cttgggcgga tgtttgtcaa aacaatttta
caggaaagtg ttttaacttc atggctacct 50460gaggcagtgg ataacagctg aagcaaaaaa
acaaaaaggc ttataaagaa gagctaggga 50520atgagatgtc tgtgagggct ttgaaaagct
ccagtgtatt tctggatatc tagaaggccg 50580taagcaagca cagggctggt atgcatgacc
agggctgtgc acattctcaa gaaagacctg 50640agaaggccct aaacgctcac ctttgcctga
acttgagcat ttaaacaagc cagaagtgaa 50700agctaaagca gagttgtcag gggccttaga
gtgttgaagg aatgccctaa catacagaag 50760ttctcagcaa agaatgtacg atttattagt
tccagcacaa tcatcagctg accgctaagc 50820taaccaagta cagacttcag tgaccacaca
cgataaagga tagacatcac agaattaatt 50880caggaaagtc actaacaaac acacactaat
tacaaaactc agcaacaaac caccctagaa 50940gcctaacaaa cacacactaa ttacaaaact
cagcaacaac aaaccaccct gatttccaga 51000gctgccacat tatttaaaat gtcaattttt
caagaaaaaa agtacaagac atgcaaaaaa 51060aataagaaag tatggtctat acacagggaa
aaaaagcaat caatgtcaac tgtccccaag 51120aaagtataga tgtttgacat agtagaaaaa
gaatttaagt cagttatttt aaatatgttc 51180aaaggggctg ggcatggtgg ctcacacttg
taatcccagc actttaggag gctgaagtgt 51240gaggatctct tgaggccagg aatttgagac
cagcctggga aacatagcaa gaccccattt 51300ctacaaaata aaaatagaaa aattagccag
gtatggtggt acatgcctgt agtcccagct 51360attcaggagt ctgaggtggg aggactgctt
gaatgcagga gttcaaggtt acagtgagct 51420atgatcacgc cactgcactc cagcctgggt
aatagagcaa gaacctatct ctaaaaaatt 51480aaaaagttca aagacctggc cgggcacagt
ggctcacgcc tgtaatccca gcactttggg 51540aggccgaggt gggcggatca cgaggtcagg
agatcgagac catcctggct aacacagtga 51600aaccccgtct ctactaaaaa tacaaaaaat
tagccgggcg aggtggcggg tgcctgtagt 51660cccagttact cgggaagctg acacaggaga
atgatgtgaa cctgggaggc agagcttgca 51720gtgagccgag atcgtgccac tgcactccag
cctgggcgac aaagcaagac tccgtctcaa 51780aaaaaaaaaa aaataagttc aaagacctaa
agcaaatcat ttctttcgac atgaaaacgc 51840atgtctaaaa gcatggagac gatgtctcac
caaacagcaa ttctagagtc aaaaggtaca 51900ataactgaag aaaaaattca ccagagagac
tcaacagcaa atttgagcag gcataagaaa 51960aatcagcgaa gctgaagata ggtcaattga
gattatacaa tctaaggaac aaaaaaatga 52020cacacaacag agcctcagaa acttgtaata
tcaagcatac caacacacat acataagagt 52080accataagga gaaaagagaa aggggcagaa
aaaatattta aagaaataac agccaaaacc 52140ttcccaattc gatggaaacc actactctac
acatataaga agctcaacaa ctccaactaa 52200aataaactca aagagatcca cacctggtca
catcataatc aaactgtcaa aagaagtttg 52260agaccagcct gggcaaaagg gtaacgacag
tttctaaaaa aaaaaaaaaa aaaaaaaaaa 52320aaaaaaaaaa aaaattattt aaaaaaataa
aaatttttaa gtataatttt ttgtaatttt 52380tgtcctatct cattaaaaag tatataatta
taaatctttg ttgataggta tactacatat 52440ataaagatat aacttggata acagcacaaa
ggaaggcaat aggatggagc tgtgagcaaa 52500ggtttttttg gggttttctt tgttgttgtt
tttttgagac agagtcttgc tgtcaccagg 52560ctggagtaca gtcgcacaat ctcggctcac
tgcaacctct gactccctag ttcatgcgat 52620tctcctgcct cagcctccca agtagctggg
actacaggca ggtgccacca tgcccagcta 52680atttttgtat ttttagtaga gatggggttt
caccatgttg gccaggatgg tctcaatctc 52740ctaacctcgt gatccactca ccttggcctc
ccaaagcgct gggattacag gcgtgagcca 52800ccatgcccca tgcccagcca caaagttttt
tatatactgt taagttggca ctaatccaaa 52860ctaggttctt ataaattaag ttgttaattg
taatccctaa agcaatcata agtaactcaa 52920aatatatagt aaaagaaaca agggaattaa
aatagtacac taaaaaaaaa aatctattta 52980acataaaaat gagtattgga ggccagaccc
aaatgccaca tattgtatca tttcatttac 53040gtgaaatgtc aagaggaggt aaatccatat
acacaggaag aagactgatt catggttgcc 53100agggactgag gggtagagat aatgggagtg
actacaaatg ggaatgaggt ttctttttgg 53160agtaatgaaa atgttttgga actagattgt
agggatggtt gcacaacctt gtgagtattc 53220taaaaactac tcaactgtac actttaaaat
ggtaaatttt ataataaatg tatctcaatt 53280ttaaaaaatt ccttgaaaat atacaagaaa
gctgcttcaa acctctccca gcatgtctac 53340tgactcaaga gattacagcc agaactaaac
agagtattag acagatattc atctctagac 53400ttttagagat attatctatg aggcagagct
atgttctctc ccagtcgcca cccctctgcc 53460actgcctaca ggatgatctt ttttaaaccg
gattatgcaa atcctagaaa atacataacc 53520aaaataaaaa agcagggcac aaaactaaca
atttttgagc atttcatctc aaggtattac 53580ataattgtaa tttatcataa taagctatca
ttaattacaa ttagttaact gtcgttacaa 53640ataagtaatt actagaattt taaataaaat
atatatgagc caaaatattg gagactataa 53700aaacaaaaac atggtcccag cagcaagaaa
aaacaggaca tataaaaaaa caaaaagaca 53760aagacaagaa taacaaaagg aatgagaatt
aatcataaca gtatagtcat accccgcata 53820acaacatttc agcaatgaca gaccacatgc
attatgcaac agttgttcca caagattata 53880atggggctgc cctatagaag tgtaccattt
ttcatatttt atacaatatt tttactgtac 53940ttttttcatg ttatatatgc ttaaatacac
aaatgtgtta caattgccca cagtattcag 54000taaagtaaca tcttgttcac tctgtttcca
caaagaaatc acctaacact tttctcagaa 54060agtatcttca ttgttaagca acacataact
gtattttaaa atagaaaaag aaataaacca 54120cattataaaa taaagccttt gatgggtcaa
aaaataaact taaccatatg cttattgaaa 54180acacactaaa aaaaaaaaaa caaaaaataa
aaatgaaata agaataagaa aacaccagag 54240ttaaaaataa agaatgctac tacataacaa
aacgtggtgg ctaaaggcaa cgcaatactt 54300aacacgttta aagattaaaa gattaaatga
ctcaaattaa gtgctcaaat cgcaagtcca 54360acagaaatga aaagaaaata aatctaagta
gaaaaaccga atgtaagagt aattattaag 54420ccaagaaaca caatcattaa gaaaaattca
agagagcctg ccttgaaaag aaataatgtc 54480atagctaatc aataaaaaag acacgaatgt
gtaacacaaa gaataaaaac aggaatataa 54540tcaccttttt aaaaaagttt ttaaaaagtt
taatggagcc aataaggcat ataaaattat 54600gttctggcag acattaaaaa aatggcacat
tttatcaaaa tttgacaatt cagagtaaag 54660ggtactgaaa tttaaggttc agagcaccaa
cttaaaacaa ccaaaaaaat ggttattcaa 54720cttggaaaga gtaactagaa ccaaggcata
aaagggagtt taagtatgtg ttcctccaat 54780tcaaaatcct tcaggaatag gagttctggc
cacttgtgag caacacatca taccaccaaa 54840ccccaattcc agaaaattac agttacaatt
ttataaggat aaatgattac tatgcacaat 54900tctatgggaa taagtttgaa tatctaaata
aaatggatgc ttttcttcaa aaatgtaact 54960aaccaaaata gaccaaagaa gtacctaaat
tatgctgagt gaaaaacgcc agacttaaaa 55020tagtacatac tgtagaattc aatgtatatg
aagttctaga ataggcaaaa cttatttata 55080gtgatagaag acttgtgtgg ccgggcgcag
tggctcacac ctgtaatccc agcactttgg 55140gaggccgagg cgggcggatc acgaggtcag
gagatacaga ccatcctggc taacacggtg 55200aaaccccgtc tctaccaaaa atacaaaaaa
attagccggg catggtggca ggcgcctgta 55260gtcccagcta cttgggaggc tgaggcagga
gaatggcatg aacttgggag gcggagcttg 55320caatgagccg agatcccgcc actgcgctcc
agcctgagaa acagagcgag actccgtctc 55380aaaaaaaaca aaaaaaaaga aaagaaatga
gactcgtggc tgcctagggc aggaacaggg 55440aaaaatgaat gcaataggat acaagatact
ttgagggaga cggatacaat ctgtatcttg 55500attggggtag tagttatcag gtatacatct
ttgtcaaaac actaggatca tacatttaaa 55560atatgtacat tttatcatac gtaaaccaca
tctcaataaa gtagaaaaaa aattaaaata 55620acaagtatgc aaagatagta tctcaatttt
tattattttt ataatagtga ttaactcaaa 55680aatgtccatc aacaaggaaa ttataagtcc
aatatagatg atattcctaa tcacattagg 55740cataatatga tctcattttt attaatacat
aatatgattc caatttacat gcacatgctc 55800acagaaacaa gtgtagaaga taagttatgg
ttctggatga tagacttcag aagatttaat 55860gcctttttag ccatttaaaa ataatgaacg
tggctgggcg cggtggctca tgcctgtaat 55920cccagcactt tggaaggccg aggtgagtgg
atcacctgag gtcaggggtt caagaccagc 55980ctgaccaaca tggcaaaacc tcatctctcc
taaatacaaa aaattagctg ggcatgatgg 56040cgcatgcctg taatcccagc tacttaggag
gctgttgcag gagaaccgct tgaacctggg 56100aggcggaggt tgcagtgagc caagattgca
ccattgcact ccagcctggg caatgcaagc 56160aagactccgt ttcaaaaaaa aaacaaaaac
aaataaaaga acatgcatta attataaaat 56220aaatgatatc tcaacatagt gttcctggaa
aaaaataaaa tttaaagtat atatatttgg 56280caaaataaga aaactgatga ttacttttta
tcaacacaca ctagagttta ccgaagtcat 56340agagattatt ctcagatcaa aaatactgat
tgttctaaga tcaataacta ccataatcta 56400gtatttcatg gtggtataaa ataattatcg
aaagcaatgt aagttttgaa aaattagcac 56460aaatctgaaa cagaaaagct cagaaatgca
tatgagagcc agagaatata tagatgtgaa 56520ataagaactg atagagttca agaatctatt
agaaaaaatt agaggccggg tgcagcatcc 56580cagcacttgg agaggccaag gcggacggat
cacttgagcc caggagttcc agatcagcct 56640ggacaacatg ccaaaaccta aaaaaaaata
caaaaaaaaa aaatacaaaa attagctggg 56700tatggtggtg aacgccagca gtcccagcta
ctcaggaggc tgaattggga gaattgcttg 56760agtctgggag gcagagattg cagtgaggcc
agatcaggcc actgcactcc agcctgggca 56820acaaaacgaa acttggtctc aaaaaaaaag
aaaaaaaaaa aaatccaatt cacttaagtg 56880aaagaaaatc acactggcct gagaattcta
tgcaacatta ggatatggag attttattac 56940aaagatttta ttacaaagat atattacaaa
gattttaaat tcagcccaac tgtcattaag 57000tccttcataa ctttcatttt atgaaactgt
cattcataaa atgccacaag tatgaaaata 57060taaaaactta ggaaatattg tctcttccta
aggaatctcc tagacaacac atttcagaca 57120atcaaaaaac aatttgaaac gttccaatgt
aaaggaaata caggagaaaa acaaatccag 57180aacatgaaac tcctaaggaa tctcctgtcg
aacatgcttc agacacccta ttgagaagct 57240taaacataag aaacatacgg agaaatgaaa
gaataaaatc cagaagattc cacaaaacga 57300tctccttttt gcaacaaatc gatgacatga
acaaggcgct ggggtgtggg ggtggggtgc 57360tacccataat aaccaaacat aacgcactga
ggccaagcat attggctcat tcctgtaatc 57420ccaacactct gggaggctga ggtgggaggg
tcacttaagg tatttgttat tttttgcgta 57480caaatattta actaacttaa agtaaaataa
agattaaccg cacacatcac ctgcgaattc 57540tcttattttc taggttaacg ccagaaatat
aatcaaaatt taattatcta ctattatact 57600cactactccc gtgtaatagc gtagtccaac
cacatgacct ctcaaacttc caaataaaac 57660ggaatctact tcttcatcac tagttagaaa
gtcatctgga gggataacat cttggaattc 57720aaaacgtgga aagaaagttg gatatgagag
gcgtggaaaa tttccatgaa ctccatactg 57780gacagtctgc aagtacttcc aaactggatc
cctatttttt tttaaaggca aagaaaaaca 57840atataatatt taagtatttc caaagaccat
atgagtagtt ttctcatgct ttacttcagc 57900aaatgaaatc ttacatggga atcacggtaa
agattaatag ttactctcgt taaagcagga 57960attgcagtga gtggagaagg cttctaggaa
ctctacctgt tttcaatatt ggctcttcta 58020cctgccccct aggtattcaa aatgaaaagc
ctagtcagag ttcactaacc tctttcccag 58080tgaaaatcat ctaaattgat atgtatatga
tgtctgtaac tacaattatg ggccggatgc 58140ggtggctcac gcctggaatc ccagcacttt
gggaggcaga ggcaggtgga tcacctgagg 58200tcaggaattt gagaccaacc tggccaacat
tgtgaaaccc tgtctactac aaatacaaaa 58260attagccagg cgtggtggcg cccatatgta
atcccagcta ctcgggaggc tgaggcagga 58320gaatcacttg aacccgggag gcgtaggttg
cagcgagccg agattgtgcc actgcactcc 58380agcctggacg acagagcgag actccatgtc
aaaaaaataa tttaaaaaat gaaatcatat 58440gccaaaattt caatagggat tgagctatat
aaaagactta taaaaataca actattaatg 58500taccagaagt tgtttataat tttccagaaa
atacatccag catccatcac tgtaactccc 58560tgcgagcaga acacctattc tggtatccaa
accacgccac cgtgggaaat tggccccaag 58620ccacccgtac ccgccttccg tcgccggttt
aagcataacc ccaaacccat taggtgtaac 58680tgtttcggga atgaccataa ataccaattt
ggacaatgaa aagtgaaggg gagtctactg 58740gagcattgca aaaagtttcc attctcctaa
gaacagtcgc ttatttgcga tgcgttttcc 58800ttgcaagtgg cgcatatctc tactagcgcc
tctgcaaata cttatctgca cgtctgtgca 58860tacagataag ctgtgagcca cttggggaca
aggactgtct tactcggccc tccaccgagc 58920acaatgcctg gcgcacagtg ggtatctaat
ggatgttttg ttaaataaag cactggccga 58980gatgctttaa gccccaggcc ccccacagtc
ggtgacagag atttcccaag tccctaacac 59040gggactcgcc ctaggagccc ctactcgcca
gcgaagacaa tgcatttatt tctccggcgg 59100ccacatatgc gaccaacaga acgaatacag
ctgcacaaat cgcccaggga acgcagagga 59160acgcggggaa ggtcaggttc atttggggac
gcctccaggc cgttagaccg agcgccccac 59220cccctccgcc cccttcacct cttgaacatc
caggacatgg cgctgagtgg gatgacaaga 59280ggagcgcctc ggctcccctg gatcgttttc
gagccgcctc gatacgcctc cttccaggcc 59340ccgcagccct gaagccgggg acaaattccg
agcgccggat caggagcgca cgactgaaag 59400gtaagtcgcc gcgagtccag tcagacgtcg
acgccgtctc cttctgcaac aatctgggag 59460accagcgtcg ctctgtgact ggcactagga
aagcccaatc acgaagagga gagtgcggag 59520ccaaaccagt cagagcacag aagggagggc
aactccgccc cgctgccatt caaagacggc 59580ggggggtccg ggctgcaagg gtggttccat
ccgggttctt ccccgccccc aaggcgggcg 59640cgcgggaaag ccacgaggcc ccaggagtgc
gactgcggtg cctgcggtgc cggtgttttg 59700tttgattccc tgcctcaaac ggagggaaac
gaccttcctt tatcctacga gtcctaagac 59760tgaaccccat tctaaaggct ctaccgtatc
cttcctctta ttttcttctc atctaatgtg 59820gcacataata gggccttata ctaaaggagc
tccacggttt acgagaccta gagctaccgg 59880cgagatactt tacttcattc cctgtggtga
accgtgggga ctttcgctcc aaattttcat 59940gttaagcctc agcgtatgca tgagacacaa
cgagtttgga aaatcttaaa tggaacttag 60000agtcccctcc ccacctcttt ttgttatttt
taaggaaaat tttcctttct tggtgcagga 60060aacccatcac atgtttatta cagctatggg
ggcgtttgcc tgaaatggtg gacgggacca 60120ttttccccgt gggcacttgg ctgctccagc
caagaggggg aggcccttgt tttcctcaag 60180gaactgcaga ggggcgctct gaggccctcc
atggctctct ttccagagtc tgaggtgacc 60240ggaaggagaa tgcggccctg ggaccgtcaa
ccttggacca gctgcagccg acgcctggca 60300gggctggtcg ctttgcgttg aggaggctgc
tgtccccgaa gctggccttt taatcgcaca 60360gggcaggaag ctggtggtgg cgcccagctg
cacaggcggc accatgtaac tgccagataa 60420tacttgcgcg tcacagagag gtccatgtta
cacgcctgtc agcacaataa tattaggtgg 60480tcagcttttc ttttttcttt tttttgtttt
tttttctttt ttattgagat ggagtcttgc 60540tctgtccccc aggctggagt gcaatggggg
tgatcttggc tcactgcaac ctccgcctcc 60600cggattcaag cgattctcct gcctcagcct
ccggagtagc cgggattgca ggagcccgcc 60660accacgccca gctaattttt gtatttttag
tagagacggg gtttcaccat attggccagg 60720ctggtcttga actcctgacc tcaggtgatc
cgcccgcctc ggcctcccag agtgctggga 60780ttacaggcat gagccaccgt gccaggccgg
tagtcagctt ttcaagacac atttgttcat 60840tatcgtaaat aaactgtagt gatctctaat
catgaaccat ggatgagcaa tagaatttga 60900aacaatgtat tatttcattt gaccaaagtt
gatgaggaag ataaagacaa tggcatttca 60960aattatttta attgttgtat gttcttcttg
aagtgtgttg aggcaaatgg caatacagtt 61020cagcttttag tatgccagat tttaaataaa
ttcttggaaa atatgccaga aattgctcaa 61080attgatgttt tgtaagagta agaaagtcat
gctcactaga caaagaaaaa attccaaata 61140tgagaacata gctctttcag acttaagact
ggccaggcgc aatggctcac acctataatc 61200ccagcacttt gggaggccga gttgggtgga
tcacctgagg tcaggagttc gagaccagcc 61260tggccaacat ggtgaaagcc tgactctact
aagaaaatac aaaaattaag cagacgtggt 61320ggcacgcacc tgtagtccta gctacttggg
aggctgaggc aggagaatcg cttgaaccag 61380ggagacatgg gttgcagtga gccgagatca
agccactgca ctccagcctg agcaacagag 61440cgagactcca tctcaaaaaa ataaaaagac
ttaagatcta tgaataatga ctgtcccatg 61500gttaaagaat gtgctttgaa tgaactaaaa
tttgctattt aaaggaatgg tatggaaagg 61560aaaggaatcc aaaatttctc catccagcca
cttcccagtc acaaacacac ttctcatctg 61620cacccccagc cacacacaca cacatgcccg
cgcgcgcgca cacacacaca cacacacaca 61680cacacacaga acctttatgc aaattaatca
tgtcatgtca ctcccctgtt tgattcagtg 61740agcctgaaat ccaagaatgg catatgtggc
tcttcctccc acagtatgtt ttctctatgt 61800tatcaatatt tcacatccca gaaccaggag
taaaacattc tttcccttaa tcattctttg 61860ttttatattt aaagatcaag tacaatttgt
actagtttga ttaaaatgtt acagcaatta 61920caatttcaaa actattatac taaataatgt
tttctgaaaa attaactttt ttggtttttt 61980cttgatttat tctgataaca gcatcacaag
tagatatgaa aaatgaacac ttgtaactgg 62040aaaatgaact gtagggtggc ttgtggggtt
tggctggtga gtaagaagga aagtggcact 62100aaaaggacgg tggggaagat aagggccagg
ttacatagga acttaagagt ctccagtaaa 62160atttgtgttt taactgcaat ggaaagccat
tgaatgtttc gagcaggagg ataacgactt 62220gatttaggct tttaaaaatg ctggcagctc
tgtggagaat tacaggaaac aaggatagaa 62280gcaactgata gaaaattatt gtgttcagat
aagagatggt ggtggcttgg aaagggaagg 62340tgatgaagcc aagagaacca aaatgttcac
tgataaattt aggtaggaat ggtatggaaa 62400ggaaaggaat ccaaaatttc tccagccagc
cacttcccag tcacaaacac acttctcatc 62460tgcaccccta gccacacaca cacatgcccg
tgcacacaca cacacacaca cacacacaga 625204027DNAArtificial Sequenceprimer
40tacgccactt acaaaaaaaa cgcatcg
274131DNAArtificial Sequenceprimer 41tttaagttat tcgtattcgt ttttcgtcgt c
314231DNAArtificial Sequenceprimer
42aatatacacc acttacaaaa aaaacacatc a
314334DNAArtificial Sequenceprimer 43ggttttaagt tatttgtatt tgttttttgt
tgtt 344422DNAArtificial Sequenceprimer
44gctcctcttg tcatcccact ca
224525DNAArtificial Sequenceprimer 45cgtctttgct tagtccatct gcctt
254621DNAArtificial Sequenceprimer
46cgatggtcta tgaaacttgg a
214727DNAArtificial Sequenceprimer 47gaaattgtgt cagtaatacc tcttcac
274824DNAArtificial Sequenceprimer
48tggggtttcg tggttttttc gcgc
244925DNAArtificial Sequenceprimer 49ccgcgaatcc aatcaaacgt cgacg
255029DNAArtificial Sequenceprimer
50atttttgggg ttttgtggtt tttttgtgt
295129DNAArtificial Sequenceprimer 51atcaccacaa atccaatcaa acatcaaca
29
User Contributions:
comments("1"); ?> comment_form("1"); ?>Inventors list |
Agents list |
Assignees list |
List by place |
Classification tree browser |
Top 100 Inventors |
Top 100 Agents |
Top 100 Assignees |
Usenet FAQ Index |
Documents |
Other FAQs |
User Contributions:
Comment about this patent or add new information about this topic: