ENDNOTES | Data Mining: Opportunities and Challenges

Chapter IV - Feature Selection in Data Mining
Data Mining: Opportunities and Challenges
by John Wang (ed)
Idea Group Publishing 2003

Continuous objective functions are discretized.
This is one of main tasks in the 2000 CoIL challenge (Kim & Street, 2000). For more information about CoIL challenges and the data sets, please refer to http://www.dcs.napier.ac.uk/coil/challenge/.
If other objective values are equal, we prefer to choose a solution with small variance.
This is reasonable because as we select more prospects, the expected accuracy gain will go down. If the marginal revenue from an additional prospect is much greater than the marginal cost, however, we could sacrifice the expected accuracy gain. Information on mailing cost and customer value was not available in this study.
The other four features selected by the ELSA/logit model are: contribution to bicycle policy, fire policy, number of trailer, and lorry policies.
The cases of zero or one cluster are meaningless, therefore we count the number of clusters as K = κ + 2 where κ is the number of ones and K_min = 2 ≤ K ≤ K_max.
For K = 2, we use F_complexity = 0.76, which is the closest value to 0.69 represented in the front.
In our experiments, standard error is computed as standard deviation / iter^0.5 where iter = 5.