Quoting comorin (Reply 12):
I would go with Chaos Theory when dealing with systems of high complexity. Otherwise I could set up a machine learning program and train it with thousands of variables to predict the safety of each flight |
The difficulty here is nothing to do with chaos theory.
The problem with applying machine learning to flight safety data is getting enough decent training data. There are a large number of flights, but almost all of them pass off without incident.
On top of this, human beings are already working very hard to spot the patterns in aircraft safety data, and we probably do a fairly good job of it, and therefore change the patterns as we find them.
To give a purely hypothetical example, say a particular engine has a design flaw, such that it explodes if aircraft suddenly goes from very humid air to very dry air. Given say, a million flights in which a thousand resulted in crashes because of this defect, a machine learning algorithm would definitely be capable of spotting the pattern "Aircraft with this kind of engine, flying through areas where this kind of weather often happens are less safe".
But of course, that data would never be collected from the real world. Humans will figure out the design flaw very quickly, perhaps after the first explosion, and fix the design flaw so it doesn't happen. So the pattern will get squashed before it is sufficiently strong for the machine-learning algorithm to detect. So, the machine learning algorithm would see, perhaps, one or two examples of the engine exploding in this kind of weather, and then thousands of examples of the engine working fine in the same situation, and it would do the algorithmic equivalent of shrugging it's shoulders and saying "Those crashes were probably outliers, there's nothing interesting here".
There probably are a few patterns that machine learning would find, but they'd most likely be things we know already: long flights are less safe than short flights, flights through bad weather are less safe, flights on old aircraft are less safe, flights operating in countries that have less rigorous safety standards are less safe, flights with inexperienced pilots are less safe, etc. etc.
Disclaimer: I know nothing about aviation, and I didn't finish my PhD in machine learning (but I picked up a few things along the way)