|
|
|
|
LOCUS NC_003071 6000 bp DNA linear PLN 20-AUG-2002
DEFINITION Arabidopsis thaliana chromosome 2, complete sequence.
ACCESSION NC_003071 REGION: 16970001..16976000
VERSION NC_003071.2 GI:22326553
KEYWORDS HTG.
SOURCE Arabidopsis thaliana (thale cress)
ORGANISM Arabidopsis thaliana
Eukaryota; Viridiplantae; Streptophyta; Embryophyta; Tracheophyta;
Spermatophyta; Magnoliophyta; eudicotyledons; core eudicots;
Rosidae; eurosids II; Brassicales; Brassicaceae; Arabidopsis.
REFERENCE 1 (bases 1 to 6000)
AUTHORS Town,C.D., Haas,B.J., Maiti,R., Hannick,L.I., Chan,A.P.,
Ronning,C.M., Smith Jr.,R.K., Yu,C., Wortman,J.R., White,O. and
Fraser,C.M.
TITLE Arabidopsis thaliana chromosome 2 CHR2v07142002 genomic sequence
JOURNAL Unpublished
REFERENCE 2 (bases 1 to 6000)
AUTHORS Town,C.D. and White,O.
TITLE Direct Submission
JOURNAL Submitted (29-JUL-2002) The Institute for Genomic Research, 9712
Medical Center Dr, Rockville, MD 20850, USA, cdtown@tigr.org
COMMENT PROVISIONAL REFSEQ: This record has not yet been subject to final
NCBI review. The reference sequence was derived from AE002093.
On Aug 20, 2002 this sequence version replaced gi:15224037.
The Arabidopsis chromosome sequences were assembled from BAC
sequence data using the current BAC tiling path. The chromosomes
include unfinished BACs from the HTGS division of GenBank. Gaps
within unfinished BACs are represented by NNNNs. The latest
BAC-based annotation from the TIGR database
http://www.tigr.org/tdb/e2k1/ath1/ath1.shtml was propagated to the
chromosomes and discrepancies between gene models in the regions of
BAC overlaps were resolved. This chromosome annotation is
available in TIGR s XML format at
ftp://ftp.tigr.org/pub/data/a_thaliana/ath1/PSEUDOCHROMOSOMES.
FEATURES Location/Qualifiers
source 1..6000
/organism="Arabidopsis thaliana"
/cultivar="Columbia"
/db_xref="taxon:3702"
/chromosome="2"
/clone="CHR2v07142002"
misc_feature <1..>6000
/note="Chromosome Sequence Derivation: nucleotide sequence
in this region was derived from BAC clone T7D17."
gene 340..1213
/gene="At2g40780"
/note="T7D17.4"
mRNA join(<340..453,616..691,802..928,1015..>1213)
/gene="At2g40780"
/note="protein id: At2g40780.1"
/transcript_id="NM_129641.1"
/db_xref="GI:18405502"
CDS join(340..453,616..691,802..928,1015..1213)
/gene="At2g40780"
/codon_start=1
/protein_id="NP_181610.1"
/db_xref="GI:15226743"
gene complement(1574..2309)
/gene="At2g40790"
/note="T7D17.3; contains a thioredoxin family active site
(PDOC00172)"
mRNA complement(join(<1574..1768,1823..1984,2043..>2309))
/gene="At2g40790"
/note="protein id: At2g40790.1"
/transcript_id="NM_129642.1"
/db_xref="GI:18405505"
CDS complement(join(1574..1768,1823..1984,2043..2309))
/gene="At2g40790"
/codon_start=1
/protein_id="NP_181611.1"
/db_xref="GI:15226744"
gene complement(2859..5438)
/gene="At2g40800"
/note="T7D17.2; supported by cDNA: gi_17473708"
mRNA complement(join(<2859..3384,3929..4201,4641..>5438))
/gene="At2g40800"
/note="protein id: At2g40800.1, supported by cDNA:
gi_17473708, supported by cDNA: gi_20148506"
/transcript_id="NM_129643.2"
/db_xref="GI:22326301"
CDS complement(join(3118..3384,3929..4201,4641..5234))
/gene="At2g40800"
/codon_start=1
/protein_id="NP_181612.1"
/db_xref="GI:15226745"
BASE COUNT 1963 a 1164 c 1139 g 1734 t
ORIGIN
1 attcgttcaa aattcagaaa ttgcgaatcg acggagacta tggaggagtg gaatttcaag
61 cgccggagaa gattaataga ccgccgcgtg atataataac gtattgccgg tttatctgcc
121 aattcgtttg caccgacgtg gcttgagaat gacaatagta ccctcagctt ttaaataaaa
181 taacgaaact acctcctctt ctctattgga actattccga ttccagtaaa acggcacaca
241 actgcaaaac cctaatctca agttttctgt cgattttgat cttttggttg taattttgtt
301 tgtgaaagtt tcggactttt ggaatttgag gtagaagaga tgaacagagg aagaaggaat
361 ctgaaacaag cggcgtcgga ccaggatttc acgcttgagg aatgtcagag cattgcccaa
421 gtcgtctctc tcagaggttc caatcaaatt gaggtaaaag ataaacgctt ttttcttagg
481 tttcatacaa tctcgccaat tcatagcatt ttcttcggaa ttttctttgg atgggttggc
541 ttgtttgtat gtatgtatgt actgaattaa agtttgtgtt ttaactgctg tagttgtttt
601 gttgtctctt tgaagataat ggatgcaaaa ggagagaact cattagcttt gtttccagcc
661 aagtttcgtg agagcatgtg gatcagacga ggtacttttt cttgttagtc ttctctggtt
721 tgttctttct tgaatggtta gatttactgc atgatttggt gtggattgga ggagtttgag
781 atgtgctaca ctgatttgca ggaagctttg tagtgattga ccatacagga aaggaaaagg
841 ctcaagagtc tggtagcaaa gttacatcta ttgtatgtaa agttctattc tttgagcaag
901 tccgtcttct tcaaaagtct ccggaatggt atagcttcta tcttttgttc tgtatggatg
961 gcttctcata ttgagtaatt atagctcatg ttagtttgga catggtgaat gcaggccaga
1021 aatcttcaaa gatactagac cgattccagc tgagaaaagc tcacccattg aacagcatga
1081 agatgacggt gaagttgatt cgagtgatga tgatgatggt atgcctccat tgcaagcaaa
1141 cacaaacaga ttgagaccgt ttggggtgaa gtgtgatgca gaaactgatt cagggtcaga
1201 ttccgattca tagaaacatc cggtacattt cttttcgcag cctcaactta atttctcaat
1261 ataggggatt tatagttgca agctgtattt tataaacagt atgtaaccaa cacagatctg
1321 ctgataaagc agagttttgc tttgcactaa atcgataatt gatttatata acatgcttct
1381 cattctcttc ttgcaattct tgataagtct taataaggag tcagaaaagt taatattggt
1441 tgattgagat taacactaca agagtgcatc aaagttaata acaaaaatct cagtgatacc
1501 aacatgatct gcatataaat actggtcaac tttttatttg gataattttc tgacatggaa
1561 actagaaaaa acgttaagat tgtctaagaa ggagatttgc agctgctgca gttttcttct
1621 gaagctctgc agcatctcca cctacaagtt tatccatttg cctcccatct ttaaggaaca
1681 ctactgttgg agttgcatcc acgttccatt catgactaaa ctcctgtatc aaacatattt
1741 taacacccaa atcgctcagt ttcgagatct ctcatcttca caaagaatca ttctaagtaa
1801 cagaagttga ttggttactt acagcgagct cttcgacgtc tatagtgaca aatatcattg
1861 aggtgtatgt ggatgcaagc tcctggtata ttggtaatat tgttttacta ggtaaacacc
1921 acgaagcctt gaaattcact acaagctgtt ccatgttaca aagaatcaga tcatgtcgag
1981 ctatctacag aggaagaatg caaaacaaca tagaaaggca aatcatagga ggaagagctt
2041 acaattttgc catgactatt agcttctgtg atcttctcct cccacttctc cattctactt
2101 accggatgga ccttcccttt tataaagtag gaccctttct ggctcctagc ttgagtcttg
2161 ttccttctgt tgcaacagca gatacatgaa cacacctgtt tcagaacatg agattttcta
2221 tcaccatttt catctgaaac tgcacaatgc ttatgaacat tcaacgggtt taataaaaat
2281 ttgctagtta gattcgacat gttctccatg aaactcgtaa attttaaggt aaaggtctta
2341 cctttttaca acaaggaatt cttgtacaat gattccccat taactttcta tactagcttg
2401 gatcattagc tctctgaatc aacagcacaa acaaaaaacc atttttctcg atattaacaa
2461 ttctaaacag ataaggagaa atttacagat agaattataa ggatctcaaa taaaggacct
2521 agctatttat ggtttgataa aagaagaaga ttaagaacgg attacttcta gatcttgcca
2581 acttaagatt gttcatgttc ttgcatgaat aagagataaa tccgtgatca aagatcaaaa
2641 gtttctgtga aaagaatcaa aaacatcaac aagaaggatc cagagaacgt tcaatttaag
2701 caacaaaaca ttatatttgt cttgggttac atcatacatg cgatgggaaa ttggttcgat
2761 taaccataac taattttgta gaccagaaac tggttgaaga cgacatagaa aatgacgctg
2821 tgtctatttt gggcgggcta tggtgggcct ttattaagct tacaccagtt tatactttgg
2881 aagacaaaag ttctaaacca aacggttttt cacagcacag gctgctctaa caaggagata
2941 catcttgtat gaagaagaat cagtaatcac aatagcatcc tctgatcaac atagccaggg
3001 aaatttattt ctcgaaagtt agcaaaaaca aaaaaaaaac tgggttttgg aaaagattca
3061 tggtgttgtt cttatatcca ccgaagaaga aaatatttct ggggaagaaa atacgagtta
3121 tgagccctcc ttctcgagtt tctcgatctc ttcacgatgc tttctttcag cttcttgaag
3181 ttctctttca gcgtcttctt cctcttcgat tctatcaaga ttatcgaact ccttagttgc
3241 tgccattgct ttcacaacag ggtctcttaa cacagagatt aaaccaccac caattctgta
3301 ctcttcttca tcgccaatca gaaataaccg ttggtcggga ccagatgcca ttggaatatc
3361 taccgccaaa agcttcatgt catactgaag aaacagacac atgaacataa gaagtagacc
3421 aaactcacaa tatagggagc gtgattggta atagggattc gctgatggta tgaaaaaatg
3481 aaagcaccta aaagactaga caggcatacg caatgtcatg taatatcagt ttccacatgt
3541 tatatataca agcctaaacg tttaactgaa actgaaagaa cagctatgaa actccaattc
3601 ttcttccaaa ttcaccagca tagagagatt gatgaatgaa aagattatct ctccgcttaa
3661 ccaaatgccc aaccaacctt acgaaatcta caaccaaaaa atctgccgag acatccttat
3721 aaactcagcc cttcgcaaac tgcactctag ccaactcaca acagctctaa tctacaacat
3781 tcagtcctag gctagataca gtagacctat agctaagagt tgagctgctg aactttgtca
3841 aagctaatta caactctgaa gattacaaca tgaacaagac agaagaaagt tgagtaaacg
3901 aggaactgag gagttgagaa agtgaaacct ggcctttctt cttcttaact tcaacgctaa
3961 caaggccctt ttgctcagaa ccctgaacgg ggaatagaag aaagcagcgc ttgctcctaa
4021 ttgttggctt gaatttctta aatgttatgc caccacctga catcacatac gctcgtaaat
4081 ctgatcctga cagaggagca cccataactt ctagaatttc agctgccgtg ttgatcttcc
4141 tcatggtcat tctataaact ttatccggat taattgtaaa ccttgagcgg aggtataatc
4201 cctattatta gaaaaatgca gatagctgtc aaggagtgat tcctaaatga tacgggagta
4261 ggagactgta acagcaacaa accatatcat atagcagttt ttgcattcca taaacaatat
4321 catgctataa gacacttgaa atatacagtg caagatctcc tgtatttctg tgaaggagat
4381 cagttaacaa ggagaatgaa cctatactct agtcgtggta aaatcaaacc aatatcattg
4441 taagctccac atccattcac tatcaaccaa taaccctaca gagaaattcc tcagtttcat
4501 ttttattcag gacaacttca tggcactgag ttgggcaaac attgcgagat catcactgaa
4561 acattactaa gaaatggagc caaccaccaa gaattgaagc acaaattccc atcgaaaagc
4621 acataaagag cgaaacttac agaaaatgct acaatggctg aggagagtgc aagaaaccca
4681 tacttggcca tgccctctga gaggccaaca aacgtactag caattccaaa cataatccgc
4741 caaaggaaaa tacatacaaa aacacctgca gctccaaaca caacaagact attctttttc
4801 caaaatgcat caatgtgcaa ccctatagcc tcacggtatc tagcaaaagt agaactaaca
4861 gctttgacag gcttatcaac aacctttctc gcaaaactcc catcaacctt tcgaaatcca
4921 gagctcttag tggataccaa tctaaacgcc gatgcaaaac tcacattcac acgaggcaaa
4981 cccaaagctt gctgcaactt aggattgagc ttaggagacg acagaatctg atataatccc
5041 acgtttttgg aagtgggttt cgaggataag ctgtggaagc taaacttagg ctgaagctga
5101 gaaacaccat tggagggaat agctggagaa gacaaagctg atgggtttga tcggccaatt
5161 gtgacagggt tgacccgagt gtaatgcagc cgaataaacc cttgaattgc tttgaaatcc
5221 gatggtttca ccatattcta tagcctttta gatcgatcac acaaaagctc gaaccgaaga
5281 tcctaacttg gagtatcgcg cctgattcaa aattaccttc gacttgagaa ctgatcaaag
5341 gaatcaatgg aaatgcgagt tttcaaattg ggatcgatga gttaaagagt cgtaatgaat
5401 cccattttcg ccgaagcttg ctgttcgtag cagcgattgg ggatcaatgg tgatcgatcg
5461 aattgaagaa atatcgggtc gggtcgggtc aggttgattc gtccgggtcg ggtttaattg
5521 gataccgaat tattgatcgg gttttacgtc ttactttctt gaaggaagga acaaagcaca
5581 acacaaagga cccacaagtg tctcagttca aaactccatg cccactcact actttggtcg
5641 gttcgcagat gacaggtagg caggggtaag attgtcatat gttataattc tgtctctaaa
5701 cgttttttaa aatttactta attagcaatt actatatatg taatttaata tacatttttt
5761 aaaaaaaatc tgataataaa gattgaattg attaaaatag tttaaaagaa gatttcatgt
5821 agagatgcat agaatcaaag gtagcagcct aaagtttgtt ttctaccaat ttaataaaat
5881 tggaacaata tagaatataa gcatggtcca tgctcaaaaa taacaagcat aaatagagaa
5941 atagaccata agtatgaaac catgcataga ccaaatctta attatattaa tatgaaaaca
//
Oct 21 2002 11:56:56