# One Estimator to Rule Them All: xxFIP – Part 1: xK%

xFIP is already an awesome stat. It improves upon FIP in the notion that HR/FB in pitchers is not a skill. So xFIP is calculated as:

If you have read my article on my other ERA Estimator TIPS, you would have read about my goal to eliminate catcher framing and umpire bias from pitchers’ stats. This skill of framing should be attributed to the catcher, not the pitcher.

How can we take a good ERA Estimator like xFIP, and eliminate framing and bias? We need inputs that are not affected by framing. K’s and BB’s are the only stats affected by framing, and are both used. What needs to be done?

Expected K’s and BB’s are needed, and they need to be independent of catcher framing. K% and BB% are better to come up with expected rates for than K/IP and BB/IP since they are per batter faced, instead of per inning. There are already good xK% and xBB% formulae, but they use actual strikes and balls. Strikes and balls are affected by the catcher, so we need new xK% and xBB%s. xHR/FB will also be explored.

Today we will be determining the xK% portion of our xxFIP formula. But first, just as a little teaser into the coming parts of this series. Here are the top 10 pitchers by xxFIP in 2013:

Rank | Name | xxFIP |
---|---|---|

1 | Jose Fernandez | 3.00 |

2 | Matt Harvey | 3.02 |

3 | Anibal Sanchez | 3.06 |

4 | A.J. Burnett | 3.11 |

5 | Justin Masterson | 3.15 |

6 | Chris Sale | 3.16 |

7 | Cliff Lee | 3.20 |

8 | Francisco Liriano | 3.21 |

9 | Clayton Kershaw | 3.25 |

10 | Yu Darvish | 3.30 |

Also, why do we care about xxFIP? What good does it do us? Well, after some preliminary tests, both out of sample (2012-2013) and in sample (2008-2012) xxFIP proves to be the most predictive estimator throughout the season. It is only beaten in the very short term by the aforementioned TIPS. But these results will be saved for later on in this series.

For xK% (and xBB%), I wanted it to change on every pitch. I also wanted it to not have a constant, and take into account every event, so that every pitch outcome (like every on-base event in wOBA) has a weight. I broke down pitches into 5 outcomes that I believed should have a role in xK%.

The first one is **BIP(Balls In Play)/Pitch**. A ball in play completely eliminates the chance of a strikeout right? BIP/Pitch is also a skill, and has correlation from year to year. It is also un-affected by the catcher. Check! You can calculate BIP/P yourself by adding ground balls, line drives, and flyballs together and dividing by the total number of pitches.

The next possible outcome on a pitch is a swinging strike. This is commonly known as **SwStr%**. It is the percent of pitches that are swung at and missed. This is obviously very important in in striking out hitters, and it is a skill for the pitchers.

The next outcome that could happen that relates is a foul ball. **Foul/Pitch **is a skill year to year and a foul is a strike! Fouls are not as helpful as swinging strikes however, as you can’t strike out on a foul ball. That is why these outcomes are separated.

The only other possibility is that a pitch is not swung at, but is it important where that pitch is? Yes it is, if it’s in the zone it should be a strike (good), if it’s outside the zone it should be a ball (bad). These will be called **ZL(Zone-Looking)/Pitch** and **OL(O-Zone Looking)/Pitch. **Shouldn’t ZL/Pitch be the same as a swinging strike? You can strike out on both right? Yes a looking strike should have the same value as a swinging strike, however a pitch in the zone is not always called a strike. On average, a pitch in the zone is called a ball around 14% of the time. This means ZL/Pitch should be less valuable than SwStr%. OL/Pitch is the only outcome left. You may think more balls on a pitch means less strikeouts, but the more balls thrown means more strikeouts. Why? Any pitch that is not put in play increases the chance of a strikeout, and a pitch watched out of the zone keeps the count alive, unless there are already 3 balls. Throwing a ball is not near as harmful as having the ball hit in play. There is also the fact that 7% of pitches outside of the zone are called for strikes.

I ran tests on a sample of pitchers from 2008-2012 who threw at least 200 innings. I excluded 2013 so that I would have a sample to test on that was out of the sample in which the formula was created. Here is the formula for xK% that was determined:

**(-0.98*BIP + 1.3*SwStr + 0.43*Fouls + 0.72*ZL + 0.11*OL)/Pitches**

There is no constant for the equation and it can be used for every year. Realistically the weights would change slightly from year to year as wOBA does, but the change is probably negligible. As you can see, the weights for each event make sense. BIP is the most harmful, while a swinging strike is the most beneficial since it is always a strike. ZL is next and more beneficial than Fouls just because you can’t strike out on a foul, but can on a ZL. OL is positive, but is near zero.

Here is the graph on the sample:

That R^2 is very large. It is larger than our xK% source that used actual strike events. Podhorzer looked at seasons with a 50IP min however, where my sample looks at at least 200 IP. Let’s see what the R^2 is in an in-sample season (say 2011) with a 50IP min is.

Wow! The R^2 is still superior to Podhorzer’s xK% even with the same IP limit. Now what about an out of sample test (2013) with at least 50IP.

Even out of the sample in which the formula was created, we see a very high R^2. This xK% is clearly the best xK% at evaluating current K%, but what about future K%?

I would expect it to do very well in predicting future K%, even better than K% itself. This is because it eliminates all other factors and only has inputs that the pitcher can control and are a skill. Every other xK% created before has never been able to beat out K% in predictability. Let’s look at the out of sample 2013 to see the predictability vs. K%.

Here is a quick and easy chart. N is the number of pitchers in the sample. TBF is total batters faced. RMSE is root mean square error and is a measure of how close each 2012 K% is to the 2013 K%. Lower RMSE is better.

Here xK% seems to be better until 180 IP. The reason that K% may be better after this is that starters remain with the same catcher who has the same framing. Some pitchers may also get a more constant bias from the umpires, but before we jump to these assumptions, here’s 2011-2012:

Now xK% is always better at predicting K% than K% is. Just remember that this is in-sample however, but it would not be foolish to say that xK% is better than K% at predicting future K% in any in-season sample.

This xK% is by far and away the best xK% to date. I have not seen one with a higher R^2, or even one that can beat K% in predicting the future!

Here are the starters in xK% in 2013 with min 150IP:

Rank | Name | xK% |
---|---|---|

1 | Yu Darvish | 0.281 |

2 | Anibal Sanchez | 0.26 |

3 | Max Scherzer | 0.259 |

4 | Jose Fernandez | 0.258 |

5 | Matt Harvey | 0.248 |

6 | Madison Bumgarner | 0.241 |

7 | A.J. Burnett | 0.236 |

8 | Chris Sale | 0.236 |

9 | Stephen Strasburg | 0.233 |

10 | Francisco Liriano | 0.232 |

11 | Justin Masterson | 0.229 |

12 | Ubaldo Jimenez | 0.229 |

13 | Scott Kazmir | 0.228 |

14 | Clayton Kershaw | 0.227 |

15 | Tim Lincecum | 0.226 |

16 | Justin Verlander | 0.225 |

17 | Shelby Miller | 0.225 |

18 | Cole Hamels | 0.224 |

19 | Homer Bailey | 0.219 |

20 | Gio Gonzalez | 0.218 |

21 | Lance Lynn | 0.218 |

22 | Felix Hernandez | 0.217 |

23 | Cliff Lee | 0.217 |

24 | Matt Moore | 0.216 |

25 | Erik Bedard | 0.215 |

26 | Jeff Samardzija | 0.213 |

27 | Dan Straily | 0.209 |

28 | Julio Teheran | 0.208 |

29 | Chris Tillman | 0.205 |

30 | Ian Kennedy | 0.204 |

31 | Mike Minor | 0.203 |

32 | Ryan Dempster | 0.203 |

33 | Ricky Nolasco | 0.201 |

34 | Zack Greinke | 0.201 |

35 | C.J. Wilson | 0.199 |

36 | Mat Latos | 0.198 |

37 | Matt Garza | 0.196 |

38 | Bud Norris | 0.195 |

39 | Patrick Corbin | 0.195 |

40 | Felix Doubront | 0.194 |

41 | Kris Medlen | 0.194 |

42 | Hisashi Iwakuma | 0.193 |

43 | Derek Holland | 0.193 |

44 | A.J. Griffin | 0.192 |

45 | James Shields | 0.192 |

46 | Adam Wainwright | 0.192 |

47 | Matt Cain | 0.19 |

48 | Hyun-Jin Ryu | 0.189 |

49 | John Lackey | 0.188 |

50 | Jose Quintana | 0.187 |

51 | Dan Haren | 0.187 |

52 | Juan Nicasio | 0.186 |

53 | R.A. Dickey | 0.186 |

54 | Edinson Volquez | 0.184 |

55 | Ervin Santana | 0.183 |

56 | Jon Lester | 0.181 |

57 | CC Sabathia | 0.18 |

58 | Jered Weaver | 0.178 |

59 | Doug Fister | 0.177 |

60 | Jordan Zimmermann | 0.175 |

61 | Rick Porcello | 0.175 |

62 | Tommy Milone | 0.174 |

63 | Miguel Gonzalez | 0.173 |

64 | Jeremy Hellickson | 0.173 |

65 | Jorge de la Rosa | 0.173 |

66 | Travis Wood | 0.172 |

67 | Andrew Cashner | 0.17 |

68 | Roberto Hernandez | 0.169 |

69 | Dillon Gee | 0.168 |

70 | Dallas Keuchel | 0.167 |

71 | Hiroki Kuroda | 0.167 |

72 | David Price | 0.166 |

73 | Jason Vargas | 0.165 |

74 | Scott Feldman | 0.164 |

75 | Jeff Locke | 0.162 |

76 | Edwin Jackson | 0.161 |

77 | Wade Miley | 0.159 |

78 | Jarrod Parker | 0.156 |

79 | Andy Pettitte | 0.156 |

80 | Paul Maholm | 0.154 |

81 | Yovani Gallardo | 0.153 |

82 | Jhoulys Chacin | 0.149 |

83 | Mike Pelfrey | 0.147 |

84 | Wily Peralta | 0.146 |

85 | Jerome Williams | 0.145 |

86 | Kyle Lohse | 0.144 |

87 | Bartolo Colon | 0.141 |

88 | Bronson Arroyo | 0.138 |

89 | Mark Buehrle | 0.137 |

90 | Eric Stults | 0.133 |

91 | Mike Leake | 0.133 |

92 | Lucas Harrell | 0.131 |

93 | Kyle Kendrick | 0.128 |

94 | Kevin Correia | 0.12 |

95 | Joe Saunders | 0.12 |

96 | Jeremy Guthrie | 0.116 |

*In the next instalment we will look at the best xBB% for pitchers to date!*

December 3, 2013

It will be interesting to see by how much xxFIP beats xFIP as an estimator. The beauty of FIP and xFIP is in their simplicity. You’ve added a lot of complexity to the model, but it may not be parsimonious. Looking forward to parts two and three!

December 4, 2013

“The next outcome that could happen that relates is a foul ball. Foul/Pitch is a skill year to year and a foul is a strike! Fouls are not as helpful as swinging strikes however, as you can’t strike out on a foul ball.”

Unless it’s a bunted foul ball that is a third strike. Regardless, well done!

December 24, 2013

There is also the fact that 7% of pitches outside of the zone are called for strikes.I would think that outside zone pitches that are called strikes would be highly affected by catcher framing and therefore not something that you would want to include.

December 25, 2013

The catcher framing is not included! All pitches thrown out of the zone (no matter what it’s called) are assumed to be worth the same.