Stuff+: The Future of Pitching Analytics
A Deep Dive into Baseball's Most Advanced Pitching Metric
8/9/20258 min read
Stuff+: The Future of Pitching Analytics - A Deep Dive into Baseball's Most Advanced Pitching Metric
In the ever-evolving world of baseball analytics, few metrics have captured the attention of front offices, scouts, and fans quite like Stuff+. This sophisticated pitching metric represents the cutting edge of how we evaluate what makes a pitcher truly effective. But what exactly is Stuff+, how does it work, and why has it become such a game-changer in modern baseball evaluation?
The Evolution from "Stuff" to Stuff+
The concept of measuring a pitcher's "stuff" isn't new to baseball. For decades, scouts and coaches have used the term to describe the raw quality of a pitcher's arsenal—the nastiness of their slider, the life on their fastball, or the devastating movement of their curveball. However, quantifying this elusive quality proved challenging until the analytics revolution began to take hold.
The journey toward modern stuff metrics began in the mid-2010s when researchers in the FanGraphs community started developing operational definitions of "stuff." These early models aimed to represent "a three-dimensional shape, where the three axes of the shape represent a pitcher's peak velocity, a pitcher's change in velocity between their fastest and slowest pitch, and the amount of distance that their pitches can break."
The current Stuff+ model, developed by Eno Sarris and Max Bay with inspiration from work by Ethan Moore, Harry Pavlidis, and Jeremy Greenhouse, represents a significant advancement over these earlier attempts. Unlike its predecessors, Stuff+ uses machine learning algorithms and vast amounts of Statcast data to create a more sophisticated and accurate measure of pitch quality.
Understanding the Three-Model System
Stuff+ doesn't operate in isolation—it's part of a comprehensive three-model system that includes Location+ and Pitching+. Each component serves a distinct purpose in evaluating pitcher performance.
Stuff+: The Raw Material
Stuff+ looks only at the physical characteristics of a pitch, including but not limited to: release point, velocity, vertical and horizontal movement, and spin rate. A pitcher's secondary pitches are defined based on their primary fastball—with "primary" defined by usage in an outing—and so are judged by velocity and movement differentials along with raw velocity and movement numbers.
The model captures what many consider the "nastiest" pitches in baseball. Stuff+ was trained against run values, so even if the research community is divided about how much a pitcher can control weak contact, the model includes an inherent nod to the possibility that they do possess some of that ability.
Interestingly, the importance of release point in the model also suggests that Stuff+ includes some deception—you'll find some pitchers with unique release point and movement combinations score very well despite lower velocities.
Location+: Command and Control
While stuff gets much of the attention, location is equally crucial to pitching success. Location+ is a count- and pitch type-adjusted judge of a pitcher's ability to put pitches in the right place. It ignores a pitch's physical characteristics and looks at count, pitch type and location.
On any given pitch, the location is hugely important, more than the stuff. But stuff is stickier season to season and start to start, so it's a safer bet. This distinction is crucial for understanding why teams might value certain types of pitchers differently in free agency or trade scenarios.
Pitching+: The Complete Picture
The overall model, Pitching+, is not just a weighted average of Stuff+ and Location+ across a pitcher's arsenal. Rather, it is a third model that uses the physical characteristics, location, and count of each pitch to try to judge the overall quality of the pitcher's process. Batter handedness is also included in Pitching+, capturing platoon splits on pitch movements and locations.
This comprehensive approach makes Pitching+ particularly valuable for evaluating a pitcher's overall effectiveness, as it captures the interaction between raw stuff and command that often determines success at the major league level.
The Technical Foundation
The sophisticated nature of Stuff+ lies in its methodology. Generally, the model aims to capture the "nastiest" pitches in baseball, using a decision tree-based model to capture the nonlinear relationships that exist across release points, velocities, pitch movement, and more.
All three metrics use the familiar "+" scale (like wRC+), with 100 being average. This standardization makes it easy for analysts and fans to understand: a pitcher with a 120 Stuff+ has stuff that's 20% above average, while someone with an 85 Location+ has below-average command.
The models incorporate cutting-edge biomechanical concepts as well. Vertical attack angle is not explicitly in the model, but it is captured by the interaction between release points and movement. This nuanced approach allows the metric to capture subtle but important aspects of pitch effectiveness that might be missed by simpler models.
Practical Applications in Modern Baseball
The applications of Stuff+ extend far beyond academic interest. Teams across Major League Baseball have integrated these metrics into their decision-making processes at multiple levels.
Player Evaluation and Acquisition
The free agent market has been paying more for stuff than location recently, reflecting an industry-wide recognition of stuff's predictive value. This trend has significant implications for how teams approach roster construction, potentially leading to a preference for high-stuff pitchers who can be taught command over polished but stuff-limited hurlers.
The metrics are particularly valuable in amateur and international scouting. Young pitchers often lack refined command but may possess exceptional raw stuff. Stuff+ provides a framework for evaluating these prospects' ceilings in a way that traditional scouting alone cannot match.
Development and Coaching
Perhaps most importantly, these metrics provide actionable insights for player development. Coaches can identify specific areas where a pitcher's stuff might be lacking—whether it's release point consistency, spin rate optimization, or movement profiles—and work systematically to address these deficiencies.
The Stuff Metric has even shown potential as an injury identification tool, with researchers noting instances where declining stuff scores preceded actual injury diagnoses. This predictive capability could revolutionize how teams manage pitcher health and workload.
In-Game Strategy
Real-time applications of stuff metrics are becoming increasingly sophisticated. Teams can now evaluate how a pitcher's stuff changes throughout a game, informing decisions about when to remove starters or which relievers to deploy in specific situations.
The Reliability Factor
One of Stuff+'s greatest strengths is its reliability in small samples. Stuff+ becomes reliable 80 pitches into the season and is extremely powerful relative to any other single stat in the tiniest of samples, while Location+ takes something more like 400 pitches to reach a similar level of stability.
This rapid stabilization makes Stuff+ particularly valuable for evaluating young pitchers, trade acquisitions, or identifying breakout candidates early in the season. Pitching+ also predicts rest-of-season results better than K-BB% in smaller samples.
The predictive power extends beyond immediate results. Similar to the graphs above, here we're measuring Cronbach's Alpha between the metric at the given PA number and the measure at the end of the year: Pitching+ also predicts rest-of-season results better than K-BB% in smaller samples.
Competing Models and Approaches
Stuff+ isn't the only game in town. FanGraphs now features two pitch quality models: PitchingBot and Stuff+. PitchingBot takes inputs such as pitcher handedness, batter handedness, strike zone height, count, velocity, spin rate, movement, release point, extension, and location to determine the quality of a pitch, as well as its possible outcomes.
Independent researchers have also developed their own stuff models, with some focusing on incorporating advanced concepts like Seam-Shifted Wake (SSW) to better understand visual deception. This diversity of approaches reflects the dynamic and evolving nature of pitch modeling.
The competition between different models drives innovation and helps validate findings across different methodological approaches. When multiple independent models reach similar conclusions about a pitcher's quality, confidence in those assessments increases significantly.
Limitations and Criticisms
Despite its sophistication, Stuff+ is not without limitations. Like all analytics, it must be understood within the broader context of baseball evaluation.
The Human Element
One prevalent critique is that a too-heavy reliance on statistics may overlook the intangible elements of baseball, such as teamwork, morale, and leadership, which are difficult to quantify. Measuring the Mental Game Despite our efforts, we can't know the truth of what goes on in players' heads, but it's clear some guys are better at the mental game than others.
Stuff+ captures the physical aspects of pitching but cannot measure a pitcher's competitiveness, ability to rise to the occasion, or capacity to make adjustments during an at-bat. These "intangible" qualities often separate good pitchers from great ones.
Context Dependencies
The longer a pitcher is in the big leagues, the more their actual results matter when weighed against their Pitching+ numbers. This suggests that while stuff metrics are excellent for identifying talent and predicting breakouts, experience and adaptation play increasingly important roles as careers progress.
The metrics also cannot fully account for the dynamic nature of pitcher-batter interactions. A pitcher might have excellent stuff on paper but struggle against certain types of hitters or in specific game situations.
Sample Size Considerations
While Stuff+ stabilizes quickly, Location+ requires significantly more data to reach reliability, potentially limiting its usefulness for in-season evaluation of pitchers with limited appearances. This creates an imbalance in how we can evaluate different aspects of pitching performance.
Technological Limitations
The quality of Stuff+ is inherently limited by the accuracy and consistency of the underlying Statcast data. Measurement errors, ballpark effects, and equipment calibration can all introduce noise into the system that may affect individual pitcher evaluations.
Real-World Case Studies
Recent research has highlighted both the power and limitations of Stuff+ through specific player examples. Andrew Heaney, despite having "below-average velocity, extension, and induced vertical break but above-average horizontal break and spin rate," generates better than average whiff rates and expected outcomes despite having only average Stuff+.
Conversely, Julian Merryweather has "better than average velocity, extension, induced vertical break, and spin" with "well above-average Stuff+ (almost 30% above an average pitch)" but produces "below-par whiff percentages and xwOBA."
These examples illustrate that while Stuff+ is highly predictive, it doesn't tell the complete story for every pitcher. The interaction between stuff, command, sequencing, and situational factors creates a complex web that no single metric can fully capture.
The Future of Pitch Modeling
The evolution of stuff metrics shows no signs of slowing. Several exciting developments are on the horizon:
Advanced Biomechanics Integration
Future models are likely to incorporate more sophisticated biomechanical data, including detailed analysis of Seam-Shifted Wake effects and their impact on visual deception. This could help explain some of the current disconnects between predicted and actual performance.
Machine Learning Advances
As machine learning techniques continue to evolve, we can expect more sophisticated models that can capture non-linear relationships and interaction effects that current models might miss. Deep learning approaches might eventually identify patterns in pitch effectiveness that human analysts haven't even considered.
Real-Time Applications
The development of real-time stuff tracking could revolutionize in-game strategy. Imagine being able to see a pitcher's stuff grade update with each pitch, providing immediate feedback on fatigue, effectiveness, and optimal usage patterns.
Integration with Other Data Sources
Future models will likely integrate biomechanical data from wearable sensors, high-speed video analysis, and even physiological markers to create more complete pictures of pitcher performance and health.
Conclusion: The Analytics Evolution Continues
Stuff+ represents a remarkable achievement in baseball analytics—a metric that successfully quantifies one of the sport's most subjective concepts. Being able to judge a pitchers' ability to throw good shapes and velocities to the right locations should also have separate value to those trying to evaluate hurlers because of how quickly those shapes, velocities, and locations become meaningful.
However, the story of Stuff+ is really the story of baseball analytics itself: a continuous process of asking better questions, developing more sophisticated tools, and gradually deepening our understanding of this complex and beautiful game. As analytics writer Eno Sarris notes, "I think (the data) just gives you a sense of context and a sense of where this belongs in history and I think that adds a story to tell. It doesn't detract. It's part of telling the whole story."
The future will undoubtedly bring even more advanced metrics and modeling techniques. But Stuff+ has established itself as a foundational tool in modern baseball evaluation—one that has already changed how teams acquire, develop, and deploy pitching talent. For analysts, scouts, and fans alike, understanding Stuff+ provides a window into the cutting edge of baseball's ongoing analytical revolution.
As one baseball analytics expert noted, "Above all, baseball is a game of chance. No matter how many metrics we devise, it's unlikely we will ever be able to nail it down to a point, but isn't that part of what makes it so much fun?" Stuff+ doesn't take away from baseball's unpredictability—it simply gives us better tools to understand and appreciate the incredible skill required to succeed at the game's highest level.
Whether you're a front office executive evaluating trade targets, a coach working with young pitchers, or a fan trying to understand why your team's new acquisition is generating so much excitement, Stuff+ provides unprecedented insight into what makes pitching truly elite. In a sport where the difference between success and failure often comes down to mere inches and milliseconds, having the best possible tools for evaluation isn't just helpful—it's essential.
For the latest Stuff+ data and analysis, visit FanGraphs.com. The metric continues to evolve as new data sources and modeling techniques become available, ensuring that our understanding of pitching effectiveness will only grow more sophisticated with time.