(Not exactly sure if this would belong here or r/MathHelp, but I’m posting here because I think there’s too much a conceptual gap for them to be of much help given their ruleset—I don’t think I’ve showed enough of an initial attempt.)
For example… Let’s say I have data that looks like this:
The top data row contains the highest temperatures (in °F) recorded in the average month and year, and the bottom row contains the average highs over those periods, all for the weather station 4 miles southwest of Midway International Airport in Chicago (111577), over its entire period of record. As you can see, while July has the highest average monthly record high, the fact that other months occasionally experience the hottest temperature in the year (17 out of 41 years on record, in fact, with May, June, August, and September all making successful shots) leads to the average yearly record high being somewhat higher.
Whoops! *crashing sounds* *boom*
I lost the average annual record high! But I still want to use the data! What can I do?
Now, at least if we know or assume a statistical distribution (say a normal distribution, which meteorological temperature generally very roughly follows), we can calculate standard deviations for the daily highs in each month, as statistically, average monthly record highs are just the high temperatures for which 27.25/28.25 to 30/31 of the high temperature curve lies below:
(Top data row is the difference between the average record highs and average highs, while the bottom data row is the calculated standard deviation.)
This can in turn be used to roughly estimate the probability that a given month will exceed the average record high in July (not necessarily the yearly July record high in the years that happens):
|1 in ~12.1 Myr||1 in ~147 Kyr||1 in ~151 years||1 in ~22 years||1 in ~6–7 years||~4–5 in 10 years||N/A||~4–5 in 10 years||1 in ~5 years||1 in ~50 years||1 in 5.95 Kyr||1 in ~393 Kyr|
(Top data row is the maximum temperature anomaly required to exceed the average July record high, the second row is the Z-score of that anomaly, and the third row is the estimated period of return of that anomaly assuming a normal distribution with no skewness [again, not exactly realistic, but ehh.])
But I’m unsure on how to go any further, if doing so is even possible. Anyone have any insights?
(And yes, I know that is very rare for average monthly extremes to be provided and not either average yearly extremes or their immediate precursors [extremes for individual years/months], but I am dealing with just such an instance…)
All news and articles are copyrighted to the respective authors and/or News Broadcasters. eWeatherNews is an independent Online News Aggregator
Read more from original source here…