Sequence generation for assembly. For this project we are generating fairly high coverage in a number of different insert sized libraries. The assembly strategy is based around a seed allpaths assembly (the Broad Allpaths assembler) followed by seed assembly improvement using homegrown tools, Atlas-link and Atlas-GapFill, which can significantly improve the results. Thus we generate sequence data to enable the Allpaths assembly. As of Nov 2011 this is: - 40X genome coverage in 180bp insert library (100bp reads forward and reverse); and 40X 3kb insert data. To enable better scaffolding and local gap filling we additionally generate 500bp, 1kb, 2kb, and 8kb insert sizes at > 20X coverage.
Please cite https://doi.org/10.15482/USDA.ADC/1173238
Source: Baylor College of Medicine i5K Project Summary