Thanks to my mod­er­ate knowl­edge of sta­tis­tics, I know that I have a lot more to learn in the field and should never make assump­tions about data or analy­ses (even my own).

Because of this I share a griev­ance with Zed Shaw who says that “pro­gram­mers need to learn sta­tis­tics or I will kill them all”. Required read­ing and advice not just for pro­gram­mers, but for every­one who looks at data, cre­ates mod­els, or even reads a newspaper.

I have a major pet peeve that I need to con­fess. I go insane when I hear pro­gram­mers talk­ing about sta­tis­tics like they know shit when its clearly obvi­ous they do not. I’ve been study­ing it for years and years and still don’t think I know any­thing. This arti­cle is my call for all pro­gram­mers to finally learn enough about sta­tis­tics to at least know they don’t know shit. I have no idea why, but their con­fi­dence in their lack­ing knowl­edge is only sur­passed by their lack of con­fi­dence in their per­sonal appearance.

My rec­om­men­da­tion? Read this arti­cle to realise that you know noth­ing, and then pick up a copy of John Allen Pau­los’ Innu­mer­acy and Dar­rell Huff’s How to Lie with Sta­tis­tics in order to realise that you know even less than you thought (but a hell of a lot more than the aver­age person).