Blog

An Early, Nerdy Look At The Problem System

1 April 2026

Within the new season’s early going, the problem system has been all the craze throughout the majors. For those who don’t consider me, you may learn ESPN’s protection of it, or The Athletic’s, or MLB.com’s, or … effectively, you get the concept. The protection has been intensive and constructive, and I couldn’t agree with its enthusiasm extra. I really like the brand new system, and I’m additionally actually excited to consider challenges generally. There are such a lot of enjoyable angles to contemplate. So right here’s the mathematics nerd’s tackle what challenges have seemed like thus far, and what I’m most to find out about them transferring ahead.

How I’m Considering About Challenges
Each time a strike or ball known as, there’s a chance for a problem, no less than as long as the related workforce has one remaining. That makes it straightforward to measure the potential worth of a problem on any given pitch: It’s value nonetheless a lot flipping the results of that individual pitch would change the sport state of affairs within the difficult participant’s favor. All we now have to do is determine what number of runs have been prone to rating within the inning in every case and examine the 2.

That sounds laborious, but it surely really isn’t so dangerous. All it’s important to do is assemble an RE288 matrix, which measures what number of runs have scored, on common, after the sport reaches a given mixture of outs, baserunners, balls, and strikes. For instance, over the previous 10 years of main league play, groups have scored 1.02 runs per inning after a batter reaches a 1-0 depend with a runner on third and one out, however utilizing our matrix, we will work out the entire attainable run expectations a batter may attain in that plate look:

Run Expectancy, Runner on Third, One Out

	Strikes
Balls	0	1	2
0	0.98	0.90	0.81
1	1.02	0.94	0.81
2	1.09	0.99	0.88
3	1.19	1.10	0.96

MLB, 2016-present

Now let’s think about a catcher weighing whether or not to problem a known as ball. To find out the worth of profitable problem, we will calculate the change in run expectancy of a ball versus a strike. After all, the state of affairs issues. We’re most excited by these counts the place the end result would end in a stroll or a strikeout, because it’s the distinction between first and third with one out, or a person on third with two outs. Let’s have a look:

Run Worth of a Profitable Problem, Runner on Third, One Out

	Strikes
Balls	0	1	2
0	0.12	0.13	0.43
1	0.15	0.18	0.50
2	0.20	0.22	0.58
3	0.12	0.26	0.84

These are listed in run values, and so they’re are massive numbers. Flipping a 3-2 pitch from a stroll to a strikeout is value a whopping 0.84 runs; it’s the distinction between a jam and a cushty inning. Flipping the primary pitch of an at-bat is way much less impactful. And that’s in an essential spot, with a runner on third and fewer than two outs. Subsequent, think about that our batter hits a sacrifice fly, leaving the bases empty with two outs. The problem values for every pitch within the subsequent batter’s time on the plate are far decrease:

Run Worth of a Profitable Problem, None On, Two Out

	Strikes
Balls	0	1	2
0	0.03	0.04	0.08
1	0.04	0.04	0.09
2	0.06	0.07	0.13
3	0.07	0.10	0.23

Flipping a stroll to a strikeout nonetheless issues, however virtually all the pieces else is low worth. Even in the event you don’t do the mathematics, you understand this intuitively. If an umpire misses a name deep within the depend with a runner on third and one out, it stings. It feels prefer it might be a key turning level within the recreation. If a bases-empty, two-out depend will get to 1-0 when it may have been 0-1, it hardly issues.

You Aren’t a FanGraphs Member

It seems to be such as you aren’t but a FanGraphs Member (or aren’t logged in). We aren’t mad, simply upset.

We get it. You wish to learn this text. However earlier than we allow you to get again to it, we would wish to level out a number of of the great the reason why it’s best to change into a Member.

1. Advert Free viewing! We cannot bug you with this advert, or some other.

2. Limitless articles! Non-Members solely get to learn 10 free articles a month. Members by no means get reduce off.

3. Darkish mode and Basic mode!

4. Customized participant web page dashboards! Select the participant playing cards you need, within the order you need them.

5. One-click knowledge exports! Export our projections and leaderboards in your private initiatives.

6. Take away the images on the house web page! (Truthfully, this does not sound so nice to us, however some individuals wished it, and we like to offer our Members what they need.)

7. Much more Steamer projections! We have now handedness, percentile, and context impartial projections out there for Members solely.

8. Get FanGraphs Stroll-Off, a custom-made yr finish evaluate! Discover out precisely the way you used FanGraphs this yr, and the way that compares to different Members. Do not be a sufferer of FOMO.

9. A weekly mailbag column, solely for Members.

10. Assist help FanGraphs and our total employees! Our Members present us with essential assets to enhance the location and ship new options!

We hope you will take into account a Membership at present, for your self or as a present! And we notice this has been an awfully lengthy gross sales pitch, so we have additionally eliminated all the opposite adverts on this article. We did not wish to overdo it.

Runs, not flipped calls, are the top objective of the problem system. You possibly can win 20 challenges in an 0-0 depend with two outs and the bases empty and nonetheless assist your workforce lower than profitable a single problem in a 3-2 depend with a runner on third and one out. I’m fairly assured that that is the fitting approach to consider it. Tom Tango’s intensive overview of the problem system makes use of the identical methodology. It’s what I got here up with previous to consulting other people on the web site, and it’s additionally what they got here up with earlier than they talked to me. That’s a very good signal that runs are the right foreign money for problem worth.

Why not use win chance and leverage index? Since you don’t get to take your challenges dwelling with you. For those who’re trailing 7-1 within the eighth, leverage index would let you know there’s mainly no play that’s actually value difficult. The win chance good points aren’t that top if you’re virtually positive to lose anyway. That isn’t proper, although. All you are able to do is use the challenges it’s important to add as many runs of worth per recreation as attainable. A run is a run is a run. That doesn’t change relying on whether or not you’re behind or forward, or what inning it’s. For those who’re making an attempt to measure how a lot challenges have impacted a workforce looking back, win chance is an affordable measure. For those who’re making an attempt to find out how gamers ought to behave, run worth is the best way to go.

Catchers Are Higher At Difficult Than Hitters or Pitchers
With that run worth framework in thoughts, we will check out all of the challenges which have occurred thus far (effectively, via March 30) and observe some broad patterns. Via Monday’s motion, groups have issued 227 challenges, 124 of which have been profitable, good for a 54% conversion price. The protection has challenged extra steadily, and it’s principally been catchers; 118 of 124 defensive challenges have come from behind the plate. Hitters have challenged solely 103 instances.

Defenders have been profitable on 57.6% of their challenges, whereas hitters have been profitable on 50.5% of theirs. This tracks with the spring coaching and 2025 minor league knowledge. It’s simply simpler for catchers to evaluate the strike zone. They get to have a look at the ball immediately because it is available in, reasonably than catching a side-on look at it, and so they’re inches away from the plate as a substitute of 60 toes.

It’s value noting that 57.6% just isn’t 100%. Catchers have a greater concept the place the ball crossed the plate, however they clearly don’t have an ideal image. I feel that main leaguers may have already instructed you this. They’re by no means completely positive whether or not or not a pitch was a strike till they return to the dugout and check out their iPad. Why don’t gamers problem each dangerous name? As a result of they don’t at all times know whether or not or not a name is dangerous within the second.

Batters Perceive Run Leverage
I wished to know what makes a participant extra prone to problem, so I dug into the information extra deeply. I took the inhabitants of challengeable pitches within the 2026 season (once more, via March 30) and famous each’s potential problem worth. I cut up the pitches into three buckets, every with a roughly equal complete challengeable achieve. For instance, the 5,984 lowest-leverage problem alternatives carried a complete challengeable achieve of 485 runs, whereas the subsequent 2,571 problem alternatives have been value as much as 524 runs. The 890 highest-importance problem alternatives have been value as much as 441 runs on their very own. (For those who’re questioning why the buckets aren’t completely equal, it’s as a result of I can’t slice issues up infinitely; tons of pitches have the very same worth due to the best way RE288 works.)

If gamers are behaving optimally, you’d count on to see much more challenges within the highest-value bucket. And nice information: That’s precisely what’s occurring thus far. Batters problem extra steadily, even with a decrease accuracy price, in additional essential conditions. They’re much less correct in these essential conditions, which is rational. They’re much less correct as a result of they’re difficult extra. And so they’re difficult extra as a result of the worth of succeeding is sort of excessive:

Batter Challenges By Run Leverage

Run Leverage	Challenges	Alternatives	Problem Fee	Success Fee	Runs Gained Per Problem	Runs Gained
Low	49	2074	2.4%	55.1%	0.05	2.5
Medium	32	689	4.6%	43.8%	0.09	2.8
Excessive	22	216	10.2%	50.0%	0.26	5.8

Now, this knowledge means that hitters don’t have an amazing concept, within the mixture, of whether or not a pitch is within the strike zone. In low-leverage conditions, you wish to be very positive of your self earlier than difficult. You get limitless appropriate challenges, however solely two unsuitable ones. That implies that optimum habits early within the depend or with the bases empty is one thing like “problem if you understand it’s a ball, in any other case put it aside for a much bigger spot later.” And but hitters have been profitable barely greater than half the time. Yikes!

That’s to not say that there aren’t any gimme challenges. Elly De La Cruz has solely challenged one pitch all yr. It got here in a 1-0 depend with a runner on first and two outs, squarely within the “low run leverage” bucket. That’s the form of problem you solely make in the event you’re positive – however De La Cruz was, and he was proper.

After all, there are causes to problem even when there aren’t many (or any) runs on the road. The bottom-value problem by runs added got here on a borderline pitch to Spencer Torkelson in an 0-0, bases empty, two out state of affairs. That sounds dangerous. However it was the ninth inning and Detroit had two challenges remaining. Spend ‘em in the event you’ve bought ‘em in that state of affairs.

Most low-run-leverage challenges are dangerous, although. That’s how you find yourself with such a low success price, and so few runs added per problem. Denzel Clarke challenged this one. Evan Carter challenged this one, and even made an “I’m unsure” face as he did it. Vladimir Guerrero Jr. burned a problem with two out and nobody on within the first inning. These guys are all undoubtedly being instructed by their groups to solely problem in the event that they’re positive. They challenged anyway, in a state of affairs the place the achieve was so small that that they had to make sure they have been proper to have it make sense. It’s simply laborious to know the place the ball crossed the plate!

Catchers Perceive Run Leverage, Too
Catchers face a special worth proposition than hitters. The inhabitants of pitches they technically can problem features a lot that they positively gained’t – curveballs within the filth, fastballs to the backstop, and different miscellaneous non-competitive pitches. Hitters swing at a lot of the pitches they’re positive are within the strike zone, so the inhabitants of pitches they could problem is disproportionately filled with borderline calls relative to catchers. In different phrases, you may’t examine either side’s problem charges one to at least one. However whereas the denominator is completely different, catchers behave equally to hitters, difficult rather more steadily in essential conditions:

Catcher (And Pitcher) Challenges By Run Leverage

Run Leverage	Challenges	Alternatives	Problem Fee	Success Fee	Runs Gained Per Problem	Runs Gained
Low	59	3910	1.50%	62.7%	0.06	3.4
Medium	43	1882	2.30%	58.1%	0.13	5.6
Excessive	22	674	3.30%	45.4%	0.22	4.8

Catchers characterize all however six of the challenges right here.

We already know that catchers do higher than batters general. They problem extra low-importance pitches than batters and succeed at a better price. Additionally they problem extra steadily when profitable a problem is extra invaluable, regardless of it coming with a decrease success price. They nonetheless aren’t excellent, in fact – once more, calling balls and strikes may be very laborious.

Catchers have challenged within the lowest-importance spot – bases empty, two outs, 0-0 depend – 5 instances already. They’ve solely been profitable 60% of the time. Yainer Diaz missed one within the first inning, and it wasn’t even shut. Jonah Heim challenged a pitch that was greater than two inches low, maybe having fooled himself along with his personal body job. This shot of him watching the problem end result reside is extraordinarily relatable:

So catchers nonetheless have some work to do. However it’s clear that they’re usually occupied with this the fitting approach. Problem charges go up and problem success charges go down because the rewards for a profitable problem improve. That’s the way it needs to be. In different phrases, catchers are rational actors, even when they aren’t excellent arbiters of the zone.

Low-Significance Challenges Are Nonetheless Occurring Too Regularly
The precise math behind a breakeven problem success price is hard, and I’m not assured that I’ve solved it but. It’s important to take into account your subjective confidence that you understand whether or not the pitch was a ball or a strike, the reward for a profitable problem, and likewise what number of extra alternatives to problem you may need within the recreation. Given a restricted variety of incorrect challenges however an infinite variety of profitable ones, your certainty modifications the chance of you paying any price in anyway in your problem. I may give you an approximation, although, and I feel it has some clear takeaways.

Think about two hypothetical conditions. In a single, a catcher has a 50/50 shot at flipping a stroll to a strikeout. Since that is hypothetical, let’s say it’s value 0.7 runs to take action. Within the second state of affairs, the catcher is 80% positive that they’ll win a low-value problem, value 0.1 runs if profitable. Right here we’ll say that the worth of 1 unspent problem is 0.1 runs, the typical run worth gained per problem issued thus far. These numbers are all roughly consultant of real-world run values, although the 80% problem success price might be optimistic given what we’ve seen thus far.

The mathematics right here is fairly straightforward. Earlier than the problem within the first state of affairs, the catcher’s workforce had 0.1 runs value of unspent challenges. Fifty p.c of the time, they get 0.7 runs and retain a problem for a complete of 0.8 runs of worth. The opposite 50% of the time, they lose and find yourself with 0 runs of problem worth. The online worth, then, is the distinction between 0.4, the anticipated worth of difficult, and 0.1, the anticipated worth they held earlier than difficult. That problem is value 0.3 runs in expectation, in different phrases.

Now let’s ask one other query: What’s it value to a catcher to problem these 80%, low-importance conditions time and again till they miss? We are able to run the mathematics on that too. Twenty p.c of the time, they get nothing. One other 16% of the time, they win one problem after which lose the subsequent. You possibly can preserve happening the road like that, including 0.1 runs for each profitable problem and fixing all the equation. I went so far as calculating the chances of the catcher hitting 25 challenges in a row. Sum up the anticipated worth of those they win, and also you get that an 80% success price in low-leverage circumstances, repeated infinitely, is value 0.39 runs. On condition that the preliminary unspent problem was value 0.1 runs, that’s a achieve of 0.29 runs of worth. In different phrases, it’s equally worthwhile to problem low-importance calls, even with 80% certainty, as it’s to problem a pure coin flip in an essential spot.

After all, I’m most likely overestimating the worth of that 80% technique. Nobody will get 25 probabilities to problem in a recreation. For those who restrict the catcher to 5 challenges and assume that they’ll simply pocket the unspent problem for later in the event that they hit all 5, that technique is value 0.19 runs in expectation. It’s simply actually laborious to make sure sufficient of a low-importance problem to offset the worth of getting one left when you really want it.

You may discover that each methods have constructive anticipated worth. Why not do each, then? That actually appears affordable to me. In case you have a catcher who can overturn calls with an 80% certainty price, he ought to most likely do this till he loses a problem. However the danger of not having a problem remaining for the high-leverage spots is actual. In finance, we used to name this selecting up pennies in entrance of a steamroller. The chances are good – however the rewards aren’t sufficient to justify the danger.

We Don’t Know Who’s Good But
It’s going to take some time to determine who’s really good at difficult. There aren’t that many observations, and never each remark has equal worth due to the differing run values for various challenges. Successful probably the most challenges isn’t inherently nice. Neither is having the perfect problem profitable proportion. Groups additionally appear to be altering their habits on the fly. It’s going to take a very long time to weed out the perfect from the worst with a lot variance. To make issues much more difficult, there’s positively worth in not difficult at instances.

That mentioned, we will say who has accrued probably the most worth from challenges thus far. It’s Eugenio Suárez. His two challenges – possibly you’ve seen them – have been value a mixed 1.73 runs of added anticipated worth. Kyle Schwarber is available in second with 0.71 runs added. On the catching aspect, Edgar Quero has added probably the most worth, however he’s used a ton of challenges to take action and hasn’t been all that profitable on them. Curiously, just about the entire catchers who’ve accrued probably the most worth have additionally missed a reasonably low-importance problem already:

Prime Catcher Problem Run Values

Participant	Profitable Problem Worth	Chall	Overturn	Runs/Problem	Finest Success	Worst Failure
Edgar Quero	1.3	9	4	0.14	0.46	0.08
Salvador Perez	1.0	5	4	0.21	0.50	0.11
Samuel Basallo	0.8	3	2	0.27	0.44	0.21
Nick Fortes	0.8	5	3	0.16	0.51	0.07
Patrick Bailey	0.8	5	3	0.15	0.51	0.10

It’s Arduous To Measure Certainty
As Tango’s analysis into challenges exhibits, “simply problem those which can be clearly unsuitable” doesn’t describe the fact on the bottom. Catchers in spring coaching challenged simply 35% of pitches that have been three or extra inches contained in the zone and have been known as balls. These are apparent strikes; they’re a strike by greater than a baseball width. These are free! For those who problem them, you get the strike and don’t lose a problem. And but catchers, even in a coaching surroundings the place they have been absolutely inspired to experiment with the brand new system, simply weren’t positive sufficient. Heck, they solely challenged 70% of pitches two-plus inches into the zone in full counts.

In different phrases, you may’t simply take a look at the place the pitch finally ends up and say that each single dangerous name will get cleaned up. Nobody really is aware of the place the ball crossed the plate once they’re difficult. Nobody is aware of the precise bodily location of the strike zone, both. The zone is a theoretical field, not a bodily one, and pitches are transferring so shortly and a lot that batters and catchers are inferring their trajectory reasonably than perceiving the ball constantly all through its flight. Typically, a catcher is bound the ball is approach outdoors and doesn’t problem, however in reality, the ball nicked the zone. Typically a hitter is satisfied a pitch was within the zone when it was really 4 inches low; the alternative occurs too. “Certain factor challenges” are generally really pitches that shouldn’t be challenged.

Challenges Work In Two Important Methods
First, they appropriate some egregious calls early in video games. Now, not each egregious name will get corrected; that’s simply not how this works. However umpires have known as strikes on 31 pitches that crossed the plate 2.5 or extra inches away from the zone this yr, and batters have efficiently challenged 9 of them. Equally, umpires have known as a ball on 11 pitches that have been within the regulation zone by no less than 2.5 inches; defenders have challenged six of them, prevailing every time. The league has reduce egregiously unsuitable calls by someplace between a 3rd and a half.

One other factor challenges do? Make it possible for much more of the highest-importance calls are proper. To date this yr, 624 pitches have been known as a ball when the distinction between a ball and a strike is value a 3rd of a run or extra. Twenty-six of these calls have been unsuitable, however 10 bought challenged and corrected. Batters haven’t executed as effectively, what with not having pretty much as good of a way of the strike zone and all, however 207 pitches have been known as a strike when the distinction between a strike and a ball is value a 3rd of a run or extra. Thirty-eight of these calls have been unsuitable, and hitters challenged and overturned 11 of these, meaningfully decreasing the share of essential calls that get missed.

That’s nice! Egregious misses and high-importance misses are the 2 instances I’m most excited by having a robotic ump appropriate the report. We don’t know lots about problem ability on a person stage but, and I’m not even able to say something about which groups are doing the perfect. Once more, that is very noisy knowledge. However challenges are decreasing the variety of incorrect calls, and so they’re doing so in a predictable and fascinating approach.

I’m excited to proceed studying extra about this technique. My chief takeaways, although, are that it’s doing what I hoped it could, and that followers appear to like it thus far. There are tons of enjoyable analysis questions to contemplate. When catchers bat, are they higher at difficult? How rather more ought to star hitters problem, and the way rather more ought to catchers problem towards star hitters? Does umpire identification change workforce problem habits? What’s an optimum problem technique? How does it change primarily based in your roster? Who’s the perfect at it? Who’s the worst at it?

I don’t know the solutions to any of these questions but. However I hope to seek out out sooner or later. And within the meantime, what’s to not like? Calls are extra correct. Each the in-stadium and on-TV expertise of a challenged pitch have been rousing successes thus far. Gamers appear to be having fun with themselves. I’m not shocked that the problem system is a hit, as a result of it labored within the minors and copies a system that labored effectively in tennis. I’m comfortable that we now have it, although, and I feel that in brief order, everybody will marvel why we didn’t at all times let gamers do that.

Supply hyperlink

An Early, Nerdy Look At The Problem System

LEAVE A REPLY Cancel reply

EDITOR PICKS

To have stood in it as umpire was God’s reward to me: SK Bansal on 2001 Eden Gardens epic

Jrue Vacation credit teammates, religion, and power after Path Blazers’ win over Clippers

I Am Stepping Away”: Tiger Woods Takes Break To “Search Remedy

POPULAR POSTS

To have stood in it as umpire was God’s reward to me: SK Bansal on 2001 Eden Gardens epic

Jrue Vacation credit teammates, religion, and power after Path Blazers’ win over Clippers

I Am Stepping Away”: Tiger Woods Takes Break To “Search Remedy

ABOUT US