Wednesday, 25 April 2007

Wednesday 25th April

Started preprocessing data by converting two data fields (side and sector) into one (quadrant). Bit of a pain updating 285 records. Still it's a start.

Saturday, 21 April 2007

Saturday 21/04/07

Spent some time analysing the data for trends to develop rules. There are some that stand out e.g. the higher the histological grade the greater the chances of recurrence. This is out all I've done lately but I'm not worried as I there's a some room in the schedule at the moment.

It's been a really good day as I recieved my TMA result and was shocked at how good it was. I would have been happy with 50%. Anyway I mustn't let it go to my head. West Ham won too so great.

Tuesday, 3 April 2007

Tuesday 3rd April

Completing TMA01. Have drawn up a schedule for the project because I had to for question 1. I should have done this earlier that way I may not have spent so much time chosing the dataset and may have made more progress in deciding what I would actually do. Hence I have been rather vague about the KBS because I should have spent more time on this.

Added link for background info about breast cancer prognosis

Sunday, 1 April 2007

Week beginning 25th March

Spent time at work extracting breast cancer data from WinPath. I spoke to staff from the breast care team to see if I can find out about patient outcomes from those who had histology results. I have a couple of contacts to follow up.

I have decided to try to use the data from work if I can get the clinical outcomes because I always have the breast cancer data that I have down loaded as a backup if this isn't achieveable. I will have to be careful not to use patient identifiable data because of the Data Protection Act. Therefore I won't be able to use a date of birth field but age will be OK.

Downloaded several papers on breast cancer prognosis and artificial intelligence via IEEE Xplore. I am reading through these but generally they have used neural networks with some success and fuzzy logic in hybrid systems. I would like to use neural networks in a similar way although, providing I can use the data from work, I will use different prognostic indicators such as oestrogen and progesterone receptor status and her2 status which were not included in these papers. I will try training a multilayer perceptron and comparing this with a radial basis function network and/or Kohonen self organising network to look at supervised vs unsupervised learning. I will try using an one of these network that use unsupervised learning to cluster the data coupled to a MLP to classify the data clusters.

I will try using a rule based system to compare it's performance with that of the neural networks. I will try using fuzzy logic to produce rules based on analysing the data I have. For example: If number of lymph nodes is high and the tumour is large then the chances of reoccurence if high.

Now to sort out that TMA.

About Me

My goal in life is to become grumpier. There's no point getting older unless you become grumpier. Working for the NHS helps as does supporting West Ham, so one day I'll end up like Victor Meldrew.