Bases and Runs | Syracusefan.com

Bases and Runs

SWC75

Bored Historian
Joined
Aug 26, 2011
Messages
33,973
Like
65,531
I’ve always been someone who likes to look at the statistical record before making conclusions about things in sports. Statistics don’t end arguments but they put them on an objective basis: even if you are going to conclude something counter to them you have to know what the stats are so you can understand what they are telling you and construct an argument around them.

Traditional baseball numbers, as revered as they are, have been poor measures of how productive a player actually is. A batting average is not an average: it is a percentage of times a player gets a hit in at bats where he doesn’t walk, sacrifice, get hit by a pitch or get on base due to an error. Why not make it a percentage of all plate appearances: if he got on base due to an error, why should that be different- to him- than an out? There are players with high batting averages with no power who don’t walk and thus neither drive in or score a lot of runs, (Rod Carew, who drove in and scored 100 runs the year he hit .388 but did neither in any other year). There are home run hitters with mediocre batting averages but who walk a lot and they both score and drive in a lot of runs, (Harmon Killebrew, who never batted .300 but drove or scored 100 runs 11 times). There are home run hitters who struck out a lot, didn’t walk all that much and had low batting averages. They might hit a home run once a week but what about the rest of the time? There are base stealers like Luis Aparicio, who led the league 9 times in a row but never scored 100 runs because he hit .300 only once, rarely walked and had mediocre power. Eddie Yost stole 72 bases in his career, never hit .300, had a little more power but not a lot but scored 100 runs 5 times because he lead the league in walks six times.

It’s clear that baseball stats were in need of revision to refocus them on the goals of scoring runs and winning games. That’s why all the modern baseball stats that have come out in recent years were developed.

But I also think sports is supposed to be fun: we aren’t trying to put a man on the moon, just get an idea of why one team won, the other lost and how the players contributed to the result. I think numbers should be easy to compute and it should be clear to the average fan what they represent. “Earned Run Average” should be as complicated as it gets. Take the number of runs the official scorer feels should be blamed on the pitcher, divide by the number of innings pitched and multiply by 9 to determine how many runs he’d give up, on average, over nine innings that are his fault. I’m not into the fancy new stats that have come out in recent years with the weird names, such as “Total Average”, (an oxymoron), “Offensive Won-Lost Percentage”, “Super Linear Weights“, “Value Over Replacement” and “Win Shares”, among many others.

Bill James came up with a stat, “Runs Created” to determine how batting and base running contribute to the scoring of runs. His initial formula was: (hits + walks) x (total bases) divided by (at bats + walks). One problem I have with that is that hits, which are part of total bases, are counted twice. And why ‘times’ total bases? But he then factored in steals: (hits + walks - caught stealing) x (total bases + steals) divided by (at bats + walks). Eventually he expanded that to what he called his “technical version”: (hits + walks + hit by pitch - caught stealing - grounded into double play) x (total bases + 26% of (walks - minus intentional walks plus hit by pitch) + 52% of (sacrifice hits + sacrifice flies + stolen bases) decided by (at bats + walks + hit by pitch n+ sacrifice hits + sacrifice flies). Then he expanded that in 13 different “technical” formulas for different eras of baseball to acknowledge statistical gaps and differences in three eras. James goes to great lengths to prove that these formulas can predict runs actually scored but never explains why it is preferable to predict them rather than that to look them up and see how many runs were actually scored, as in “runs produced”: (runs + runs batted in) - home runs, (so they don’t get counted twice). What did happen means more that what should have happened. You’re trying to0 break down actual games, not create fantasy games.

Later when he came us with his revolutionary new system for evaluating players, Win Shares, Bill had to write an entire book to explain it, 85 pages of which are devoted to the formula he uses for it. A brief idea of it is that you divide up credit for the teams wins among it’s players by this statistical formula. Actually, you don’t divide up credit for the actual number of wins. You divide up credit for three times the number of wins. I won’t bother going beyond that.

One of the simpler formulas that have become popular these days is “On Base Plus Slugging”, or OPS, sometimes simply called “Production“. You add a player’s On Base Percentage to his Slugging Percentage and you’ve measured how often he hits, how hard he hit’s the ball and how many times he gets on base. Like that spaghetti sauce, whatever you are looking for “it’s in there”. First you compute On Base Percentage: (hits + walks + hit by pitch) divided by (official at bats plus walks plus hit by pitch). Then you compute slugging percentage: total bases divided by official at bats. “Official” at bats are total plate appearances minus walks, hit by pitch and sacrifices. Total bases are hits expanded to give one base for a single, two for a double, three for a triple and four for a home run. Then you add the two percentages together to get a total” production” stat derived from some traditional numbers.

But you are adding percentages together, something that might make sense if the total opportunities, (divisors) were the same on both sides of the computation, which they aren’t here. And, again, hits are being double-counted in both the number of times on base and total bases. The resulting number is just that, a number: In 1961, Roger Maris had an OPS of .997 and Mickey Mantle had an OPS 1.138. But what does that mean? Mickey Mantle produced 1.138 something per every at bat? But is it per every ‘official’ at bat or all plate appearances or somewhere in between? And what is the “something“? Is it bases? When hits are counted twice? Should we count being hit by a pitch, when a batter isn’t trying to do that?

I think the average fan, (like me) gets lost in all this. Sports stats should be easily and logically computed, produce a number the meaning of which is clear and have a title which meets the same standard. Ideally a fan should be able to see a play on the field and be able to figure in his head how that changes the number. Averages are important in evaluating a player’s capabilities but gross numbers measure his actual achievements. Games are won by the plays you actually did make, not by projected accomplishments. If you are comparing starters with reserves, averages and percentages can be useful, but in ranking top players, I feel that gross numbers are better. They all start and usually play the whole game. You could still use averages to compare a player who missed a lot of games with one who played the whole season but what was the impact of the injured player on the games he didn’t play? Zero. That’s not reflected in the averages. It is in the gross figures.

There’s a lot of emphasis on isolating players from their teammates so that they can be measured as individuals. Runs scored and driven in are seen has being too much influenced by teammates even though they are obviously “bottom-line” figures: what you are trying to accomplish so you can win the game. So the new formulas emphasize “above the line” figures that are only important insofar as they contribute to the scoring of runs on the assumption that the above the line figures- hits, power, walks, steals, etc. are not or less affected by your teammates. I don’t buy it. Everything you do is affected by your teammates. If you have good teammates that create more scoring situations, drive you in more, and see to it that you have to be pitched to and have pitches to steal bases on, they are allowing you to fully display your skills and accomplish more. And when you watch games, it becomes obvious that it’s not just what you do but when you do it that counts.

With all this in mind, I thought about coming up with a simple way of ranking the offensive production of the best players in baseball. I prefer “runs produced” to “runs created” for evaluating actual scoring. As to base production, (which is what most of the modern formulas are really about), I tried to boil down OPS to something that made more sense. Take total bases, (one base for a single, two for a double, three for a triple and four for a home run), add walks and stolen bases.

In 1961, Roger Maris had 159 hits, which included 16 doubles, 4 triples and 61 home runs for 366 “total bases“, (really total hitting bases). He also walked 94 times. He didn’t steal a base. By the latest stats on baseballreferecne.com he had 141 RBIs, (not, as been historically listed, 142), and 132 runs scored. He produced (132 + 141 - 61 =) 212 runs and (366 + 94 +0 =) 460 bases. If you like averages, to keep to my theme of simplicity, I’d suggest simply averaging the numbers per game played, as the leagues top players will tend to be starters and play most or all of every game. Roger played 161 games in 1961 (and hit those 61 homers). He averaged 2.86 bases and 1.32 runs per game.

In 1961, Mickey Mantle had 163 hits, which included 16 doubles, 6 triples, 54 home runs for 353 total bases. He walked 126 times and stole 12 bases. He scored 131 runs, (not 132, as previous sources list) and drove in 128. He produced (131 + 128 - 54 =) 205 runs and (353 + 126 + 12 =) 491 bases. Mickey played 153 games and averaged 3.21 bases per game and 1.34 runs per game.
I think that’s easier to understand and more meaningful than Maris has a OPS of .997 and Mantle is at 1.138, or that Roger created, (by the simplest formula) 135 runs compared to 159 for Mickey, (when Roger produced 212 to Mickey’s 205), or that Roger had 36 “win shares“ and Mickey 48. It’s also more meaningful than Maris batted .269 and Mantle .317 or even that Maris hit 61 home runs to Mantle’s 54. They were comparable players that year, but Mickey was a little better.. Mantle produced more bases because he walked more and stole bases but Maris was the slightly better run producer. Incidentally, Maris batted third and Mantle 4th, so Mickey often drove Roger in but Roger didn’t drive Mickey in. One wonders if they might have been even more productive if they’d switched positions in the order, since Mickey got on base more.. But they were in a potent line-up so they both had plenty of opportunities to make good on their hitting and base running skills.

Again, numbers don’t answer all the questions you’d want to ask in ranking players and these numbers are only about tangible offensive contributions, not about defense, leadership, “the little things that show up in the box score“, etc. But I think everyone should know about them as they think and talk about players. And the numbers every kid knows about his favorite player shouldn’t be his batting average or even how many home runs he hit but how many bases has he produced and how many runs has he produced. It’s simple and meaningful. And I’m sure those enamored of more complicated stats can make a case for their stats being more precise measurements of a player’s abilities but I doubt their rankings of players would be that much different, for all the extra work. And a fan can watch a game and see a player hit a double, drive in a run, steal a base and score on a sac fly and realize that that player just produced three more bases and two more runs to add to his total in the morning paper. He wouldn’t need a calculator for that.

With that as a background, I thought I’d make a monthly post of the top ten in each league in bases and runs produced. Again there’s no need to ignore any other numbers, but you might want to have a look at these, especially at the end of the year when people start arguing about who should win awards.

National League

Bases Produced

Joey Votto CIN 232 in 76 games = 3.05 per game
Carlos Gonzalez COL 210 in 70 games = 3.00 per game
David Wright, NY 209 in 74 games = 2.82 per game
Andrew McCutchen PIT 204 in 73 games = 2.79 per game
Ryan Braun MIL 203 in 71 games = 2.86 per game
Carlos Beltran STL 199 in 74 games = 2.69 per game
Melky Cabrera SF 193 in 75 games = 2.57 per game
Hunter Pence PHI 186 in 78 games = 2.38 per game
Matt Holiday STL 184 in 74 games = 2.49 per game
Giancarlo Stanton MIA 181 in 75 games = 2.41 per game

Runs Produced

Carlos Gonzalez COL 99 in 70 games = 1.41 per game
David Wright NY 92 in 74 games = 1.24 per game
Carlos Beltran STL 88 in 74 games = 1.19 per game
Hunter Pence PHI 87 in 78 games = 1.12 per game
Melky Cabrera SF 84 in 75 games = 1.12 per game
Matt Holiday STL 83 in 74 games = 1.12 per game
Joey Votto CIN 83 in 76 games = 1.09 per game
Andrew McCutchen PIT 82 in 73 games = 1.12 per game
Dan Uggla ATL 82 in 75 games = 1.09 per game
Andre Ethier LA 81 in 75 games = 1.08 per game

American League

Bases Produced

David Ortiz BOS 214 in 76 games = 2.82 per game
Josh Hamilton TEX 212 in 71 games = 2.99 per game
Jose Bautista TOR 209 in 77 games = 2.71 per game
Adam Dunn CHI 202 in 77 games = 2.62 per game
Curtis Granderson NY 200 in 76 games = 2.63 per game
Robinson Cano NY 199 in 76 games = 2.62 per game
Edwin Encarnacion TOR 198 in 74 games = 2.68 per game
Miquel Cabrera DET 197 in 77 games = 2.56 per game
Adam Jones BAL 192 in 76 games = 2.53 per game
Ian Kinsler TEX 191 in 76 games = 2.51 per game

Runs Produced

Josh Hamilton TEX 95 in 71 games = 1.34 per game
Miquel Cabrera DET 91 in 77 games = 1.18 per game
Jose Bautista TOR 90 in 77 games = 1.17 per game
David Ortiz BOS 89 in 76 games = 1.17 per game
Ian Kinsler TEX 88 in 76 games = 1.16 per game
Nelson Cruz TEX 84 in 76 games = 1.11 per game
Adrian Beltre TEX 82 in 74 games = 1.11 per game
Edwin Encarnacion TOR 82 in 74 games = 1.11 per game
Jason Kipnis CLE 81 in 75 games = 1.08 per game
Elvis Andrus TEX 80 in 75 games = 1.07 per game
Prince Fielder DET 80 in 77 games = 1.04 per game

 
 
What did happen means more that what should have happened. You’re trying to0 break down actual games, not create fantasy games.

This isn't necessarily true. The objective of sabermetrics is to separate skill and luck of past performances and use the 'skill' part of the equation to predict likely future outcomes. Explaining why games ended the way they did is easy enough that you don't need any advanced stats - ie the stats were equal but Team A hit better with guys in scoring position. James would argue that 'timely hitting' is largely a result of luck and it's pretty clear he's right about that after the 100+ years of evidence we have.
 
This isn't necessarily true. The objective of sabermetrics is to separate skill and luck of past performances and use the 'skill' part of the equation to predict likely future outcomes. Explaining why games ended the way they did is easy enough that you don't need any advanced stats - ie the stats were equal but Team A hit better with guys in scoring position. James would argue that 'timely hitting' is largely a result of luck and it's pretty clear he's right about that after the 100+ years of evidence we have.

I'm not trying to predict future outcomes. I'm trying to analyize what has actually happened. And all you have to do is watch a game to see what when you do something is as important as what you do.
 
I think we're kinda saying the same thing. I'm just saying James' stuff isn't really meant for the postgame analyst or casual fan. But a lot of his stuff is useful for say, GM's evaluating FA's; eg: this player is coming off a career year but his bapip indicates he was really lucky last yr so he may be overvalued...Of course some personnel guys like Rueben Amaro Jr. are idiots and chose to ignore this stuff, but that's neither here nor there.

Fumble recoveries have a big impact on the outcome of football games but it's also been proven that they're basically entirely random; so it's not really something worth discussing in the aftermath of a game.
 
I think we're kinda saying the same thing. I'm just saying James' stuff isn't really meant for the postgame analyst or casual fan. But a lot of his stuff is useful for say, GM's evaluating FA's; eg: this player is coming off a career year but his bapip indicates he was really lucky last yr so he may be overvalued...Of course some personnel guys like Rueben Amaro Jr. are idiots and chose to ignore this stuff, but that's neither here nor there.

Fumble recoveries have a big impact on the outcome of football games but it's also been proven that they're basically entirely random; so it's not really something worth discussing in the aftermath of a game.


I can see the utlility of James work to a GM: he's tryign to project stuff. I'm just hadding up what has happened, which might be more useful in detemrineg all-stars or MVP awards. GMs are thinking of the future. I'm not.
 
This isn't necessarily true. The objective of sabermetrics is to separate skill and luck of past performances and use the 'skill' part of the equation to predict likely future outcomes.
The issue I take with the Sabermetricians is that their interpretation of the statistics is now being used to measure past outcome. Since their whole thing is divorcing context, that makes zero sense.
 
The issue I take with the Sabermetricians is that their interpretation of the statistics is now being used to measure past outcome. Since their whole thing is divorcing context, that makes zero sense.

I think Fangraphs uses their FIP and xFIP stuff for their pitcher WAR, which doesn't strike me as the best idea. Maybe a guy got lucky and had a low BABIP and that is why he had such a good year, and if I'm signing him to a contract for the next 3 years I want to know that, but if I'm deciding who provided more value to their team last year then I don't think that is as important.
 
I think Fangraphs uses their FIP and xFIP stuff for their pitcher WAR, which doesn't strike me as the best idea. Maybe a guy got lucky and had a low BABIP and that is why he had such a good year, and if I'm signing him to a contract for the next 3 years I want to know that, but if I'm deciding who provided more value to their team last year then I don't think that is as important.
Yup. Don't even get me started on WAR. That among all of them is the dumbest statistic out there.
 
Yup. Don't even get me started on WAR. That among all of them is the dumbest statistic out there.

I think WAR for position player is pretty good, though you still run into problems with defensive stats.
 
I think WAR for position player is pretty good, though you still run into problems with defensive stats.
The problem I have with it is that it's basically trying to turn the game into strat-o-matic baseball. The WAR formula doesn't include anything that actually relates to real game outcomes, i.e. winning games. It's all just things that should hypothetically lead to wins, with a scrub as your measuring stick.

When I bring this up, the argument I get is that it works because it isolates the player from the context, but I just don't understand why you'd do that. The context is the whole thing.
 
The problem I have with it is that it's basically trying to turn the game into strat-o-matic baseball. The WAR formula doesn't include anything that actually relates to real game outcomes, i.e. winning games. It's all just things that should hypothetically lead to wins, with a scrub as your measuring stick.

When I bring this up, the argument I get is that it works because it isolates the player from the context, but I just don't understand why you'd do that. The context is the whole thing.


I don't think you can isolate a player in a team sport. Other sports don't even worry about it. If Peyton Manning throws a touchdown pass to marvin Harrison, did that happen because Peyton Manning is great, because Marvin Harrison was great or because his offensive line was great? Maybe the defesne was lousy. They just add it up and see who threw and caught the most TD passes. If a player is in an ideal situaiton, it allows him to play to his maximum potential. Donovan McNabb may not have done as well because he had no Harrison, (the one year he had an interested Owens he did do as well), but that's just part of the analysis. It's not a statistic.
 
The context is the whole thing.
I appreciate your point, but so much success and failure in baseball is determined by chance. The difference between an RBI base hit and a double play is often inches. Is the guy that happened to hit it just to the fielder's left better than the guy that hit it right at the fielder or was it just due to chance? Of course in the fans' and media's eyes, the guy who got the hit is a clutch performer and the guy who grounded into the double play is a choker. A stat that can compare players by eliminating the random nature of the game is valuable in my opinion. Whether any stats successfully accomplish this is another question.
 
Yeah, the more I watch games the more I realize how random some of the stuff is. Guy hits the ball on the screws; but he hits it right at a guy; he's out. Guy hits a blooper, but it's placed well; that's a hit.

That doesn't really have anything to do with WAR or advanced stats or anything.
 
That doesn't really have anything to do with WAR or advanced stats or anything.
Off the top of my head I know that "FIP" only measures those things that a Pitcher can control (i.e., strikeouts, walks, and home runs) and therefore limits some parts of a pitching performance that are somewhat random. The number of batted balls that drop in for hits can vary significantly from year to year for a particular pitcher. Of course there are obviously factors that can influence home runs and strikeouts (e.g., ballpark, umpire, wind). I think there is a xFIP, which changes some of these elements to league averages.
 
Off the top of my head I know that "FIP" only measures those things that a Pitcher can control (i.e., strikeouts, walks, and home runs) and therefore limits some parts of a pitching performance that are somewhat random. The number of batted balls that drop in for hits can vary significantly from year to year for a particular pitcher. Of course there are obviously factors that can influence home runs and strikeouts (e.g., ballpark, umpire, wind). I think there is a xFIP, which changes some of these elements to league averages.

Yeah i think xFIP basically normalizes the HR rate based off the pitchers ground ball rate.
 
I appreciate your point, but so much success and failure in baseball is determined by chance. The difference between an RBI base hit and a double play is often inches. Is the guy that happened to hit it just to the fielder's left better than the guy that hit it right at the fielder or was it just due to chance? Of course in the fans' and media's eyes, the guy who got the hit is a clutch performer and the guy who grounded into the double play is a choker. A stat that can compare players by eliminating the random nature of the game is valuable in my opinion. Whether any stats successfully accomplish this is another question.
I guess I just don't see the need to eliminate the random nature, especially when evaluating events that have already taken place. I'm working under the assumption that the game is equally random for all players.
 
I'm working under the assumption that the game is equally random for all players.
Over a large sample size, it likely is, but based on data at fangraphs.com a season might not be a large enough sample size. A pitcher might have nearly identical xFIPs from season to season, but have dramatically different ERAs. This suggests (certainly doesn't prove) that a perceived good year VS. a bad year may be somewhat due to chance i.e., some years they hit them at your fielders and some years they don't.
 
^^ Yah I think I remember that being the case for Cole Hamels and his one down year in '09. Like just an absurd % of his fly balls became HR's.
 

Forum statistics

Threads
170,338
Messages
4,885,499
Members
5,992
Latest member
meierscreek

Online statistics

Members online
218
Guests online
1,234
Total visitors
1,452


...
Top Bottom