Index of /staff/wlangdon/ftp/gp-code/barracuda_0.7.105/1000
This directory contains randomly chosen paired end short (100bp)
next generation DNA sequences from The 1000 Genomes Project.
ERR239463 sequences were used to train the genetic improvement
of barracuda to give release 0.7.105.
See http://sourceforge.net/projects/seqbarracuda/
The other DNA sequences are for validation. The GI is described in
"Improving CUDA DNA Analysis Software with Genetic Programming",
William B. Langdon and Brian Lam and Justyna Petke and Mark Harman,
GECCO 2015. DOI: http://dx.doi.org/10.1145/2739480.2754652
W.B.Langdon 18 April 2015
----------------------------------------------------------------------
26 Jun 2017
Added 1000 genome dataset ERR001270 (36bp paired end)
Name Last modified Size Description
Parent directory
ERR001270_1.recal.fas> 12-May-15 14:31 575M
ERR001270_2.recal.fas> 12-May-15 14:31 595M
ERR239463_1.million.f> 10-Jun-14 20:29 246M
ERR239463_2.million.f> 10-Jun-14 20:29 246M
ERR239771_1.200k.fastq 28-Jan-15 09:22 49M
ERR239771_2.200k.fastq 28-Jan-15 09:22 49M
ERR239938_1.200k.fastq 28-Jan-15 09:22 49M
ERR239938_2.200k.fastq 28-Jan-15 09:22 49M
ERR241251_1.200k.fastq 28-Jan-15 09:22 49M
ERR241251_2.200k.fastq 28-Jan-15 09:22 49M
ERR242845_1.200k.fastq 28-Jan-15 09:22 49M
ERR242845_2.200k.fastq 28-Jan-15 09:22 49M
ERR251575_1.200k.fastq 28-Jan-15 09:25 49M
ERR251575_2.200k.fastq 28-Jan-15 09:26 49M
README 26-Jun-17 14:43 1K
validation.bat 28-Jan-15 09:19 2K
validation_2.bat 28-Jan-15 09:20 2K
wc_fastq_10-jun-2014 10-Jun-14 18:31 1K
18 files