W. B. Langdon and B. F. Buxton. Genetic Programming for Mining DNA Chip data from Cancer Patients. Genetic Programming and Evolvable Machines, 5(3):251-257, 2004. http://gpbib.cs.ucl.ac.uk/gp-html/langdon_2004_GPEM.html https://doi.org/10.1023/B:GENP.0000030196.55525.f7 \cite{pomeroy:2002:nature} "Gene Expression-Based Classification and Outcome Prediction of Central Nervous System Embryonal Tumors" data was copied from http://www-genome.wi.mit.edu/mpr/publications/projects/CNS/Pomeroy_et_al_0G04850_11142001_datasets.zip Gene descriptors and patient identification were removed from Dataset_C_MD_outcome.gct and Dataset_C_MD_outcome.xls, which were then merged and transposed. There are 7129 signed integer gene expression values for each of the 60 patients, of whom 39 survived. Oct 2021 Original mit.edu zip file gone. This directory has the data files I used in the GP+EM, 2004, article. Dataset_C_MD_outcome.trn transposed 60 x 7129 + outcome Dataset_C_MD_outcome.403 403 selected features + outcome Dataset_C_MD_outcome.2 2 selected features + outcome WBL