Have been spending a lot of time preprocessing the breast cancer data for the neural network. Until this is complete I cannot start testing. I have taken my tutors advice and tried to reduce the number of input neurons, though I have still probably got too many. The statistical analysis of the data suggested that some attributes have no influence on tumour recurrance. The location of the tumour in the breast which I took a long time converting into quadrants (and the centre) seems to have no influence on recurrance, because there is a roughly 70% / 30% split between non-recurrence and recurrance which is the same as looking at the overall ratio of non-recurrence to recurrence. Also some attributes are related. For example the menopausal status is age related so I have removed menopausal status.
I will now carry on with this preprocessing which is rather laborious because I am doing it manually. I do not have the expertise to covert the raw data into the format required by the neural network any other way. There are likely to be mistakes in this conversion but I am trying to be as meticulous as I can.
Subscribe to:
Post Comments (Atom)
About Me
- Rob
- My goal in life is to become grumpier. There's no point getting older unless you become grumpier. Working for the NHS helps as does supporting West Ham, so one day I'll end up like Victor Meldrew.
No comments:
Post a Comment