So the mission is to classify the distribution of numbers such that they are separated into Hot, Cold and Neutral.
The observation is that when looking at a 7 draw window, the chances are high that a draw consisting of all neutral will be present at least once. The observation also shows that some (but not many) 7 draw windows have NO NNN draws. These draw windows have a common thread! The neutral count for at least 1 column was 5! This is something I had been looking for for a long time... an indicator of when NOT to play! And it is firmly based in statistics because of the hard limit of the standard deviation in each column's distribution.
When running the program on the last 150 draws with no window offset, that is the data I would be making a selection from... one thing I wrote was a quick evaluation of the counts per column. When a neutral count of 5 might appear for any of the columns, it looks to be a better bet to skip a play for that game for that week... because the system trains and evaluates individual columns at a time, this seems to also hold on pick 5 data as well.
Naturally, going through the 2,400 iterations took some time... and the most extreme event happened exactly once where all 7 draws were NNN. But... with NNN accounting for 1/4 of all draws, the distribution across all 2,400 iterations was greater than 90%...
That means for most cases, the full combo set can be reduced from 1,000 to about 300 and still not throw out at least one winning combo in that week.
So it looks like the hypothesis can be accepted, that when considering the raw frequency of a short number of draws (150, in pick 3 AND pick 5), the group of numbers possible can be safely reduced to have at least one winner contained in that reduced set within a week.
Today I am updating all of the draw histories and running the back test on the PA pick 3 Evening again, the PA pick 3 Mid day, the PA pick 5 Evening and the PA pick 5 mid day. This will take hours, but I will have a clearer picture and hard numbers at the end.
Then the hard work begins to try and match other indicators to percent ranges to narrow down to one pick per game with the best chance of a match. Who knows how long that will take...
The cool part is I have a clear indicator of when NOT to play, so the actual attempt will be even cheaper.
When the counts of neutrals per column are all greater than 5, the most expensive week will be a $1 straight per day per game, so $28 to take a shot at the day and night variants of both the pick 3 and the pick 5. The start, however will be 0.50 straight wagers on just the pick 3 games, with a max cost of $7 per week and a shot at $250 per win... will roll in the pick 5 when on "house" money.
Getting ahead of myself again, the process of selecting the correct N number from each column is still going to be a daunting task.
But, I did prove my hypothesis and took the first step to proving random numbers over a short time tend to fill from the middle of the frequency range rather than the extremes.
From the massive output files, I plan on isolating just the NNN draws and begin searching for common threads in the frequency percentages. I still have indicators such as the actual standard deviation and the quartile distribution to use. When we move from number range that provides a safe reduction to a single pick, we move back into the "best guess" territory... and that has always been the case regardless of the system... best guess at a straight hit playing only one combo.
The tweaks to make it fit a jackpot style game will be substantial, mostly the big change is to the expectancy, which will be different for each game type. (6/49, 5/60, 5/69 and 5/70 all have different expectancies, bonus balls will need a different stand alone version of the script as their expectancies differ as well)
The pick 3 will be the first target, but I have always maintained that a win of $50,000 in a single game (such as pick 5) and I would go from standard membership here to platinum. So if you see that change, then you will know this worked!
Happy Coding!