Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, Devon K, Dewar K, Doyle M, FitzHugh W, Funke R, Gage D, Harris K, Heaford A, Howland J, Kann L, Lehoczky J, LeVine R, McEwan P, McKernan K, Meldrim J, Mesirov JP, Miranda C, Morris W, Naylor J, Raymond C, Rosetti M, Santos R, Sheridan A, Sougnez C, et al: Initial sequencing and analysis of the human genome. Nature. 2001, 409: 860-921. 10.1038/35057062.
Article
PubMed
CAS
Google Scholar
Venter JC, Adams MD, Myers EW, Li PW, Mural RJ, Sutton GG, Smith HO, Yandell M, Evans CA, Holt RA, Gocayne JD, Amanatides P, Ballew RM, Huson DH, Wortman JR, Zhang Q, Kodira CD, Zheng XH, Chen L, Skupski M, Subramanian G, Thomas PD, Zhang J, Gabor Miklos GL, Nelson C, Broder S, Clark AG, Nadeau J, McKusick VA, Zinder N, et al: The sequence of the human genome. Science. 2001, 291: 1304-1351. 10.1126/science.1058040.
Article
PubMed
CAS
Google Scholar
Schmutz J, Wheeler J, Grimwood J, Dickson M, Yang J, Caoile C, Bajorek E, Black S, Chan YM, Denys M, Escobar J, Flowers D, Fotopulos D, Garcia C, Gomez M, Gonzales E, Haydu L, Lopez F, Ramirez L, Retterer J, Rodriguez A, Rogers S, Salazar A, Tsai M, Myers RM: Quality assessment of the human genome sequence. Nature. 2004, 429: 365-368. 10.1038/nature02390.
Article
PubMed
CAS
Google Scholar
She X, Jiang Z, Clark RA, Liu G, Cheng Z, Tuzun E, Church DM, Sutton G, Halpern AL, Eichler EE: Shotgun sequence assembly and recent segmental duplications within the human genome. Nature. 2004, 431: 927-930. 10.1038/nature03062.
Article
PubMed
CAS
Google Scholar
Cheung J, Estivill X, Khaja R, MacDonald JR, Lau K, Tsui LC, Scherer SW: Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence. Genome Biol. 2003, 4: R25-10.1186/gb-2003-4-4-r25.
Article
PubMed
PubMed Central
Google Scholar
Stein LD: Human genome: end of the beginning. Nature. 2004, 431: 915-916. 10.1038/431915a.
Article
PubMed
CAS
Google Scholar
Salzberg SL, Yorke JA: Beware of mis-assembled genomes. Bioinformatics. 2005, 21: 4320-4321. 10.1093/bioinformatics/bti769.
Article
PubMed
CAS
Google Scholar
Lander ES, Waterman MS: Genomic mapping by fingerprinting random clones: a mathematical analysis. Genomics. 1988, 2: 231-239. 10.1016/0888-7543(88)90007-9.
Article
PubMed
CAS
Google Scholar
Sutherland GR, Richards RI: Simple tandem DNA repeats and human genetic disease. Proc Natl Acad Sci USA. 1995, 92: 3636-3641. 10.1073/pnas.92.9.3636.
Article
PubMed
CAS
PubMed Central
Google Scholar
Read TD, Salzberg SL, Pop M, Shumway M, Umayam L, Jiang L, Holtzapple E, Busch JD, Smith KL, Schupp JM, Solomon D, Keim P, Fraser CM: Comparative genome sequencing for discovery of novel polymorphisms in Bacillus anthracis. Science. 2002, 296: 2028-2033. 10.1126/science.1071837.
Article
PubMed
CAS
Google Scholar
Myers EW: Toward simplifying and accurately formulating fragment assembly. J Comput Biol. 1995, 2: 275-290.
Article
PubMed
CAS
Google Scholar
Myers EW, Sutton GG, Delcher AL, Dew IM, Fasulo DP, Flanigan MJ, Kravitz SA, Mobarry CM, Reinert KH, Remington KA, Anson EL, Bolanos RA, Chou HH, Jordan CM, Halpern AL, Lonardi S, Beasley EM, Brandon RC, Chen L, Dunn PJ, Lai Z, Liang Y, Nusskern DR, Zhan M, Zhang Q, Zheng X, Rubin GM, Adams MD, Venter JC: A whole-genome assembly of Drosophila. Science. 2000, 287: 2196-2204. 10.1126/science.287.5461.2196.
Article
PubMed
CAS
Google Scholar
Gordon D, Abajian C, Green P: Consed: a graphical tool for sequence finishing. Genome Res. 1998, 8: 195-202.
Article
PubMed
CAS
Google Scholar
Staden R, Beal KF, Bonfield JK: The Staden package, 1998. Methods Mol Biol. 2000, 132: 115-130.
PubMed
CAS
Google Scholar
Semple CA, Morris SW, Porteous DJ, Evans KL: Computational comparison of human genomic sequence assemblies for a region of chromosome 4. Genome Res. 2002, 12: 424-429. 10.1101/gr.207902. Article published online before print in February 2002.
Article
PubMed
CAS
PubMed Central
Google Scholar
Li S, Liao J, Cutler G, Hoey T, Hogenesch JB, Cooke MP, Schultz PG, Ling XB: Comparative analysis of human genome assemblies reveals genome-level differences. Genomics. 2002, 80: 138-139. 10.1006/geno.2002.6824.
Article
PubMed
CAS
Google Scholar
Hogenesch JB, Ching KA, Batalov S, Su AI, Walker JR, Zhou Y, Kay SA, Schultz PG, Cooke MP: A comparison of the Celera and Ensembl predicted gene sets reveals little overlap in novel genes. Cell. 2001, 106: 413-415. 10.1016/S0092-8674(01)00467-6.
Article
PubMed
CAS
Google Scholar
Istrail S, Sutton GG, Florea L, Halpern AL, Mobarry CM, Lippert R, Walenz B, Shatkay H, Dew I, Miller JR, Flanigan MJ, Edwards NJ, Bolanos R, Fasulo D, Halldorsson BV, Hannenhalli S, Turner R, Yooseph S, Lu F, Nusskern DR, Shue BC, Zheng XH, Zhong F, Delcher AL, Huson DH, Kravitz SA, Mouchard L, Reinert K, Remington KA, Clark AG, et al: Whole-genome shotgun assembly and comparison of human genome assemblies. Proc Natl Acad Sci USA. 2004, 101: 1916-1921. 10.1073/pnas.0307971100.
Article
PubMed
CAS
PubMed Central
Google Scholar
Huson DH, Halpern AL, Lai Z, Myers EW, Reinert K, Sutton GG: Comparing assemblies using fragments and mate-pairs. Proceedings of the Algorithms in Bioinformatics: First International Workshop, WABI 2001: 28-31 August 2001; Aarhus, Denmark. Edited by: Gascuel O, Moret BME. 2001, Berlin/Heidelberg: Springer-Verlag, 2149: 294-306. [Lecture Notes in Computer Science]
Google Scholar
Schatz MC, Phillippy AM, Shneiderman B, Salzberg SL: Hawkeye: an interactive visual analytics tool for genome assemblies. Genome Biol. 2007, 8: R34-10.1186/gb-2007-8-3-r34.
Article
PubMed
PubMed Central
Google Scholar
Lindblad-Toh K, Wade CM, Mikkelsen TS, Karlsson EK, Jaffe DB, Kamal M, Clamp M, Chang JL, Kulbokas EJ, Zody MC, Mauceli E, Xie X, Breen M, Wayne RK, Ostrander EA, Ponting CP, Galibert F, Smith DR, DeJong PJ, Kirkness E, Alvarez P, Biagi T, Brockman W, Butler J, Chin CW, Cook A, Cuff J, Daly MJ, DeCaprio D, Gnerre S, et al: Genome sequence, comparative analysis and haplotype structure of the domestic dog. Nature. 2005, 438: 803-819. 10.1038/nature04338.
Article
PubMed
CAS
Google Scholar
Mikkelsen TS, Wakefield MJ, Aken B, Amemiya CT, Chang JL, Duke S, Garber M, Gentles AJ, Goodstadt L, Heger A, Jurka J, Kamal M, Mauceli E, Searle SM, Sharpe T, Baker ML, Batzer MA, Benos PV, Belov K, Clamp M, Cook A, Cuff J, Das R, Davidow L, Deakin JE, Fazzari MJ, Glass JL, Grabherr M, Greally JM, Gu W, et al: Genome of the marsupial Monodelphis domestica reveals innovation in non-coding sequences. Nature. 2007, 447: 167-177. 10.1038/nature05805.
Article
PubMed
CAS
Google Scholar
Bartels D, Kespohl S, Albaum S, Druke T, Goesmann A, Herold J, Kaiser O, Puhler A, Pfeiffer F, Raddatz G, Stoye J, Meyer F, Schuster SC: BACCardI - a tool for the validation of genomic assemblies, assisting genome finishing and intergenome comparison. Bioinformatics. 2005, 21: 853-859. 10.1093/bioinformatics/bti091.
Article
PubMed
CAS
Google Scholar
Dew IM, Walenz B, Sutton G: A tool for analyzing mate pairs in assemblies (TAMPA). J Comput Biol. 2005, 12: 497-513. 10.1089/cmb.2005.12.497.
Article
PubMed
CAS
Google Scholar
Zimin AV, Smith DR, Sutton G, Yorke JA: Assembly reconciliation. Bioinformatics. 2008, 24: 42-45. 10.1093/bioinformatics/btm542.
Article
PubMed
CAS
Google Scholar
Arner E, Tammi MT, Tran AN, Kindlund E, Andersson B: DNPTrapper: an assembly editing tool for finishing and analysis of complex repeat regions. BMC Bioinformatics. 2006, 7: 155-10.1186/1471-2105-7-155.
Article
PubMed
PubMed Central
Google Scholar
Tammi MT, Arner E, Britton T, Andersson B: Separation of nearly identical repeats in shotgun assemblies using defined nucleotide positions, DNPs. Bioinformatics. 2002, 18: 379-388. 10.1093/bioinformatics/18.3.379.
Article
PubMed
CAS
Google Scholar
Kim S, Liao L, Tomb JF: A probabilistic approach to sequence assembly validation. Proceedings of the ACM SIGKDD Workshop on Data Mining in Bioinformatics (BIOKDD'01): 26 August 2001; San Francisco. Edited by: Zaki MJ, Toivonen H, Wang JT. 2001, New York: ACM, 38-43.
Google Scholar
Kurtz S: A time and space efficient algorithm for the substring matching problem. Technical Report. 2003, Universität Hamburg, Zentrum für Bioinformatik
Google Scholar
Benson G: Tandem repeats finder: a program to analyze DNA sequences. Nucleic Acids Res. 1999, 27: 573-580. 10.1093/nar/27.2.573.
Article
PubMed
CAS
PubMed Central
Google Scholar
Ewing B, Green P: Base-calling of automated sequencer traces using phred. II. Error probabilities. Genome Res. 1998, 8: 186-194.
Article
PubMed
CAS
Google Scholar
Churchill GA, Waterman MS: The accuracy of DNA sequences: estimating sequence quality. Genomics. 1992, 14: 89-98. 10.1016/S0888-7543(05)80288-5.
Article
PubMed
CAS
Google Scholar
Kurtz S, Phillippy A, Delcher AL, Smoot M, Shumway M, Antonescu C, Salzberg SL: Versatile and open software for comparing large genomes. Genome Biol. 2004, 5: R12-10.1186/gb-2004-5-2-r12.
Article
PubMed
PubMed Central
Google Scholar
Delcher AL, Phillippy A, Carlton J, Salzberg SL: Fast algorithms for large-scale genome alignment and comparison. Nucleic Acids Res. 2002, 30: 2478-2483. 10.1093/nar/30.11.2478.
Article
PubMed
PubMed Central
Google Scholar
Gusfield D: Algorithms on Strings, Trees, and Sequences: Computer Science and Computational Biology. 1997, New York: Cambridge University Press
Book
Google Scholar
AMOS: A Modular Open-Source Assembler. [http://amos.sourceforge.net]
Salzberg SL, Church D, DiCuccio M, Yaschenko E, Ostell J: The genome Assembly Archive: a new public resource. PLoS Biol. 2004, 2: E285-10.1371/journal.pbio.0020285.
Article
PubMed
PubMed Central
Google Scholar
Batzoglou S, Jaffe DB, Stanley K, Butler J, Gnerre S, Mauceli E, Berger B, Mesirov JP, Lander ES: ARACHNE: a whole-genome shotgun assembler. Genome Res. 2002, 12: 177-189. 10.1101/gr.208902.
Article
PubMed
CAS
PubMed Central
Google Scholar
Jaffe DB, Butler J, Gnerre S, Mauceli E, Lindblad-Toh K, Mesirov JP, Zody MC, Lander ES: Whole-genome sequence assembly for mammalian genomes: Arachne 2. Genome Res. 2003, 13: 91-96. 10.1101/gr.828403.
Article
PubMed
CAS
PubMed Central
Google Scholar
Huang X, Wang J, Aluru S, Yang SP, Hillier L: PCAP: A whole-genome assembly program. Genome Res. 2003, 13: 2164-2170. 10.1101/gr.1390403.
Article
PubMed
CAS
PubMed Central
Google Scholar
PHRAP documentation: ALGORITHMS. [http://bozeman.mbt.washington.edu/phrap.docs/phrap.html]
Mullikin JC, Ning Z: The phusion assembler. Genome Res. 2003, 13: 81-90. 10.1101/gr.731003.
Article
PubMed
CAS
PubMed Central
Google Scholar
Margulies M, Egholm M, Altman WE, Attiya S, Bader JS, Bemben LA, Berka J, Braverman MS, Chen YJ, Chen Z, Dewell SB, Du L, Fierro JM, Gomes XV, Godwin BC, He W, Helgesen S, Ho CH, Irzyk GP, Jando SC, Alenquer ML, Jarvie TP, Jirage KB, Kim JB, Knight JR, Lanza JR, Leamon JH, Lefkowitz SM, Lei M, Li J, et al: Genome sequencing in microfabricated high-density picolitre reactors. Nature. 2005, 437: 376-380.
PubMed
CAS
PubMed Central
Google Scholar
Assembly Alignment Annotation of 12 related Drosophila species. [http://rana.lbl.gov/drosophila/virilis.html]
The MUMmer Homepage. [http://mummer.sourceforge.net]
Blakesley RW, Hansen NF, Mullikin JC, Thomas PJ, McDowell JC, Maskeri B, Young AC, Benjamin B, Brooks SY, Coleman BI, Gupta J, Ho SL, Karlins EM, Maduro QL, Stantripop S, Tsurgeon C, Vogt JL, Walker MA, Masiello CA, Guan X, Bouffard GG, Green ED: An intermediate grade of finished genomic sequence suitable for comparative analyses. Genome Res. 2004, 14: 2235-2244. 10.1101/gr.2648404.
Article
PubMed
CAS
PubMed Central
Google Scholar