Saturday, February 19, 2011

More WAR: Fangraphs vs. Baseball Reference. Roll Up, Place Your Bets, Fight! Fight!

Trevor Cahill 2010 = 4.1 Baseball Reference WAR
James Shields 2010 = -1.3 Baseball Reference WAR (that's minus 1.3 WAR)

That looks more like it (remember Fangraphs had them both at 2.2 WAR).

So why the big difference?

It's all to do with the difference in how the two sites calculate WAR.

Fangraphs uses FIP to calculate WAR - which is largely a predictive, rather than descriptive stat - so Fangraphs pitcher WAR reflects what a pitcher should've done - instead of being credited with what they actually did - this is the FIP formula - ((13*HR)+(3*(BB+HBP-IBB))-(2*K))/IP + constant* - theoretically removing luck - you'll see no mention to hits given up. In 2010 Trevor Cahill had a very low BABIP and gave up relatively few hits, (hence the low ERA) but struck no-one out - thus his FIP is rubbish, resulting in a low fangraphs WAR (and the K:BB ratio fans out there will be excited to see that K's and BB's are factored into FIP)

*the constant adjusts FIP to put it on a scale similar to ERA

In contrast, Baseball Reference WAR is calculated from the number of runs a pitcher allows As adjustments are made for whether he gives up more or less runs than the average pitcher playing in front of his defence, Baseball Reference WAR gives a pitcher credit for 'luck'/preventing hits/runs, and penalizes a pitcher for giving up hits/runs, and therefore is more descriptive than Fangraphs WAR. It also takes into account quality of opposition faced - whereas on Fangraphs, they tend to mock the idea that putting up good stats in the AL East vs AL West should weigh into Cy Young discussions.

There is a discussion on Fangraphs here, justifying their approach - both are valid, I guess, but it seems perverse to credit batters for luck, but penalize pitchers for luck when calculating what is supposed to be the same stat.

To be honest, if I were going to introduce a stat called 'Wins Above Replacement' and calculate it on a year-by-year basis, I'd credit a player for his actual performance that year, not what he should've done if he hadn't been lucky/unlucky. But as it is, both Fangraphs and Baseball Reference WAR can be useful. If you want to know what a pitcher actually contributed in a given year, use Baseball Reference WAR. If you want to know what they are likely to do this year (perhaps more fantasy relevant) use Fangraphs WAR - if FIP is useful for calculating future performance. We'll cover how to predict future performance in some exciting upcoming posts - with fancy graphs and original analysis.

And finally, the WAR-related video of the day is an '80s classic:

Sunday, February 13, 2011

Quality Starts > Wins

Rob Neyer on Quality Starts:

http://www.sbnation.com/mlb/2011/2/13/1991203/running-down-quality-starts
in this 2007 piece at Baseball Prospectus -- that I reported that in 2005 all the QS added up to a 2.04 ERA ... and the non-QS, 7.70. Not that Quality Starts mean anything. Not at all.
In other science/baseball news, Josh Beckett marries a Rocket Scientist. Maybe she can help him better locate his pitches, or alternatively track the trajectories of all the home runs he gives up?

And truly, this is a Quality Start:

Wednesday, February 9, 2011

WAR - What is it good for?

Last season in baseball......2 pitchers of equal 'worth' - I give you fangraphs 69th and 70th (of 92) most valuable (by WAR)  qualified starting pitchers of 2010:

James Shields - 2.2 Fangraphs WAR but a disastrous fantasy starter - 5.18 ERA, 1.46 WHIP (#99 in our league)

Trevor Cahill - 2.2 Fangraphs WAR and a stud fantasy starter - 2.97 ERA, 1.11 WHIP, i.e quite excellent (#18 in our league).

Surely some mistake? How can this be?


Cahill's K:BB ratio was 1.87, Shields 3.67 - could this help us answer our conundrum?

To be continued..................

And open for comments

Wednesday, February 2, 2011

And we are back.......

The ESPN Fantasy Baseball Website is Live. Some 'minor' rule changes for 2011:

No Keepers
Transaction Limit of 150
Wins are no more
Avg and BB are merged into OBP

For reasons behind these benevolent decisions, and reasoned debate, go here