Sunday Afternoon Fever
DISCO: A metric and analysis of DISguised COverages on Defense
Introduction
Game theory and deception are components to almost every single play in football. In addition to great talent, football fans love to watch how head coaches, coordinators, and players are constantly trying to outsmart their opponents. One of many opportunities for teams to leverage deception is with defensive pre-snap alignments, when a defense’s goal is to confuse the offense as to what pass coverage they will run. Perhaps they are disguising a coverage, or they want the offense to believe a disguised coverage is coming when it is not. Thanks to the NFL and Next Gen Stats, we propose a metric that evaluates the frequency each team’s defense disguises their coverage in the moments before the ball is snapped: DISCO
DISCO: DISguised COverage %:
- The percent of plays a team’s defense transforms to an unlikely pass coverage given their pre-snap alignment and movements
-
Unlikely is defined as a coverage that is observed (across NFL) less than 10% of plays given the pre-snap alignment
-
Note: This metric can also be broken down by pre-snap alignment type
Approach for Developing DISCO
Our approach involves leveraging an unsupervised machine learning model, K-Medoids, to separate pre-snap alignments and movements into clusters, and then to analyze the resulting clusters to learn about what they entail post-snap. Considering the behaviors of defensive backs pre-snap provide the best indications into what coverage the defense may run, we found that linemen and linebacker data added noise to our model that did not enhance our output and analysis.
We capture the location and orientation metrics at two seconds before the snap, and motion metrics throughout the pre-snap period. We found that defenses are more likely to reveal their alignment as time passes from the huddle break, yet we need to use a time early enough that the metric can be relevant to a QB/offense who has a moment to process what is observed.
We considered the following metrics for the model (as if play direction is horizontal, left to right):
- Deepest safety/DB horizontal distance from ball
- Deepest safety/DB vertical distance/direction from ball
- 2nd deepest safety/DB horizontal distance from ball
- 2nd deepest safety/DB vertical distance/direction from ball
- Two deepest backs vertical distance/direction from center of field
- All DB direction of pre-snap motion
- Two deepest safeties/DB’s angle of body orientation
- For each CB
- Horizontal distance from ball
- Vertical distance/direction from ball
- Horizontal distance from nearest receiver
- Vertical distance/direction from nearest receiver
- Angle of body orientation
- For offensive plays using a pre-snap motion
- Defender motion relative to receiver motion
Potentially relevant metrics not included:
- Verbal signals
- Hand gestures/pointing
- Direction player is leaning
- Direction a player is looking (independent of body orientation)
- Game situation (e.g. - down, yards, score, previous play(s)
- Team historical alignments
The objective of the K-Medoids model is to identify data points among a cluster of similar data (data within proximity) while differentiating data points that are far apart. For example, just as a QB can often spot the difference between a Cover 1 and Cover 3 based on the distance defensive backs are from receivers, response to motion and perhaps their body orientation, even if both have a similar location of a middle deep safety, the cluster algorithm can recognize those alignments as distinct. The purpose of identifying unique clusters in an unsupervised model is not to predict the defenses coverage, but to further explore post-snap coverages as a function of pre-snap alignments.
Cluster Results
Theoretically, pre-snap alignments can be grouped into any number of clusters. Our model found that 4 was the fewest distinct groupings to signal meaningful and insightful differences, yet few enough to be practical and to not overfit clusters. As we see in the below chart, defenses can and have operated in any coverage from any pre-snap alignment cluster, however, we do find unique patterns across clusters.
Note: We are just showing the most common pass coverages, and all types of coverage within the same category are grouped together. E.g. – Cover 3 Cloud Left, Cover 3 Cloud Right
Cover 0 | Cover 1 | Cover 2 | Cover 3 | Quarters | Cover 6 | |
---|---|---|---|---|---|---|
Pre Snap Cluster-1 | 4% | 14% | 9% | 50% | 15% | 7% |
Pre Snap Cluster-2 | 7% | 53% | 8% | 30% | 2% | 1% |
Pre Snap Cluster-3 | 2% | 19% | 19% | 28% | 17% | 15% |
Pre Snap Cluster-4 | 1% | 20% | 23% | 26% | 15% | 16% |
Now that we have a sense of the frequency defenses operate different pass coverages from each cluster, we can explore how several of our model input metrics average out by Cluster.
Note:
- Metrics are adjusted as if all play directions are left to right
- 0 degrees is facing directly at the line of scrimmage to the left.
There are several takeaways but a few to note are:
- Clusters 1 and 2 tend to have the deepest DB aligned close to the center of the field
- Clusters 3 and 4 on average see the two deepest DB’s aligned 5-6 yards off the center on each side
- Cluster 2’s greatest distinction is having the combination of DB’s closest to the WR’s while the deepest DB is aligned in the center
- Clusters 3 and 4 have average separation between DB’s and WR’s greater than 5 yards
- A key distinction between Clusters 3 and 4 is the opposite Y-direction from the ball between the two deepest backs
- The average orientation of DB’s is lowest for Cluster 1
Metric | Cluster 1 | Cluster 2 | Cluster 3 | Cluster 4 |
---|---|---|---|---|
Farthest DB X Yards From Football | 12.52 | 12.97 | 13.53 | 13.82 |
Farthest DB Y Yards From Football | 0.59 | 0.29 | -6.78 | 5.98 |
2nd Farthest DB X Yards From Football | 8.42 | 5.75 | 11.05 | 11.02 |
2nd Fathest DB Y Yards From Ball | 1.08 | -4.86 | 5.93 | -6.20 |
Nearest DB X Yards From WR1 | 5.83 | 2.35 | 3.26 | 2.84 |
Nearest DB Y Yards From WR1 | 6.10 | 4.73 | 6.08 | 5.54 |
Nearest DB X Yards From WR2 | 3.49 | 3.34 | 5.52 | 5.00 |
Nearest DB Y Yards From WR2 | 2.35 | 3.54 | 4.94 | 5.31 |
Nearest DB to WR1 Orientation Degrees From LOS | 8.63 | 29.29 | 20.41 | 26.57 |
2nd Nearest DB to WR2 Orientation Degrees From LOS | 0.83 | 15.14 | 9.20 | 8.65 |
Visualizing the Clusters in Action
With a basic understanding of the differentiators of each cluster, we can now look at a sample play from each, plus one additional example of a disguised coverage from a (lack of) motion.
Cluster 1: Jazz
Like jazz music, this pre-snap alignment allows for a smooth transition into its most common coverage, Cover 3. There is one deep safety, but other DBs are not pressing receivers. In this play, Kyler Murray looks to hit Marquise Brown over the middle just across the first down marker. After Brown bounces off the would-be tackler, he gives the Cardinals more than they bargained for with a touchdown.
Press the play button to watch (time speed is 2x), or watch the play manually by adjusting the time toggle.
Cluster 2: Rock n’ Roll
Just as a rock song can stand out with a great guitar solo, the solo DB stands out in the Cluster 2 alignment. Here, in addition to one deep safety, a pre-snap motion is shadowed by the Saints DB. Tom Brady likes the outside 1-on-1 matchup with his wide receiver, Breshad Perriman, who catches the 4th quarter go-ahead touchdown.
Cluster 3: Classical
This alignment is structured in a methodical way, much like classical music. Compared to Clusters 1 and 2, several coverages are within the norm from this alignment. In this play, the Bears drop into a Cover 2, but there are elements to the alignment that closely resemble other coverages. We can observe two DBs pressed closely to wide receivers and the highest DB near the center of the field until drifting wide in the final two seconds before the snap (our timestamp for measuring an alignment cluster). Ultimately, DB #22 Kindle Vindor drops to coverage and comes up with the interception.
Cluster 4: Electronic
While electronic music is versatile with a variety of tones, beats, and rhythms, this alignment follows suit by being the most versatile to coverage variety. Cover 2, Quarters, and Cover 6 are not represented as the most common coverage for any cluster, but this cluster would be closest for all three. In the below case, the Bengals show two deep backs on each side of the field, are not pressing any receivers until the final second, and drop into Quarters. Tua Tagovailoa hits Tyreek Hill on the center deep ball, nicely finding the gap in coverage.
DISCO with a Pre-Snap Motion
While the previous examples showed a non-DISCO play for each cluster, the following example demonstrates how the Broncos disguised a Cover 1 from Cluster 4. In this case, the Jets motion a receiver across the field. There is some subtle movement from the DBs but considering no defensive player directly followed the receiver in motion and we observe two deep backs on each side of the field (similarly to the previous example), Cluster 4 is the classified alignment. Within a second of the snap, the Broncos’ alignment starts to shift towards Cluster 1. When the play is in action, we see one safety drop back deep middle, and the other defenders immediately find their man assignment.
DISCO Results
After identifying plays that qualify as DISCO, we can analyze the frequency of plays a team’s defense disguises coverage (weeks 1-9 in NFL 2022-23 season), as well as a breakdown of DISCO frequencies relative to each cluster for that team. While the Vikings used DISCO more frequently than any other team, the Jaguars used a Cluster 2 disguise at the highest rate. With further analysis, it may also be determined that some teams are known to prefer more often aligning untraditionally for a given coverage depending on their personnel and other factors.
Note: Plays under two minutes in a half, at the goal-line, or during 3-score games in the second half are excluded from eligibility, as we found DISCO plays are extremely uncommon in those situations.
Team | DISCO | DISCO Cluster 1 | DISCO Cluster 2 | DISCO Cluster 3 | DISCO Cluster 4 |
---|---|---|---|---|---|
25% | 40% | 22% | 4% | 2% | |
24% | 34% | 22% | 6% | 2% | |
23% | 32% | 21% | 13% | 4% | |
22% | 32% | 17% | 9% | 0% | |
21% | 27% | 18% | 7% | 9% | |
21% | 25% | 23% | 7% | 4% | |
21% | 31% | 17% | 9% | 7% | |
19% | 21% | 15% | 21% | 12% | |
19% | 24% | 13% | 6% | 3% | |
18% | 26% | 33% | 0% | 2% | |
18% | 23% | 26% | 0% | 2% | |
18% | 31% | 29% | 7% | 10% | |
17% | 26% | 11% | 4% | 2% | |
17% | 25% | 24% | 6% | 2% | |
16% | 27% | 18% | 2% | 5% | |
15% | 19% | 20% | 4% | 3% | |
15% | 20% | 12% | 11% | 4% | |
14% | 14% | 19% | 9% | 0% | |
14% | 13% | 32% | 4% | 1% | |
13% | 15% | 23% | 4% | 5% | |
13% | 17% | 12% | 0% | 3% | |
13% | 19% | 13% | 0% | 5% | |
12% | 15% | 23% | 0% | 0% | |
12% | 15% | 27% | 0% | 5% | |
12% | 16% | 12% | 3% | 0% | |
12% | 15% | 14% | 4% | 5% | |
11% | 24% | 3% | 1% | 5% | |
11% | 13% | 18% | 2% | 0% | |
11% | 15% | 17% | 1% | 5% | |
10% | 16% | 29% | 4% | 0% | |
10% | 15% | 15% | 1% | 7% | |
6% | 5% | 11% | 4% | 2% |
Lastly, in order to get a sense of how DISCO affects the game, we can plot the average change in expected points on each play, for each cluster. We are making the assumption that a successful defensive play has a negative change in expected points (although there are situational cases where that might not be the case).
We find that for Cluster 1 and 2, DISCO plays have a notable negative edge vs non-DISCO plays. For Clusters 3 and 4, there are less than 100 samples for each, nonetheless, those defensive plays underperformed non-DISCO, perhaps because many of those plays are Cover 0.
Overall, DISCO plays have a change in expected points per play of 0.02 less than non-DISCO plays.
The 3 teams with the best difference in DISCO vs non-DISCO change in expected points per play were:
- Commanders: -0.53
- Jaguars: -0.50
- Saints: -0.44
Conclusion, Limitations, and Future Enhancements
The DISCO metric is not an evaluation of defensive strategy, rather an analysis of defensive strategy. Being unpredictable sometimes means playing the hand right in front of your opponent’s eyes. Factors such as game situation or player personnel could trump the desire to disguise in certain situations. One intention for use of DISCO would be as a supplement to scouting, film, and pre-snap offensive decision making. Another is for common NFL fans to leverage as they watch games, as it can be fun to predict what teams will do on each play.
For future enhancements to DISCO:
- Incorporating the potentially relevant metrics not included, noted above
- Factoring in recent play history, game situation, and team’s history would add more context into team’s alignment decisions
- Leveraging a supervised machine learning algorithm that incorporates predictive behaviors
- Using more than 4 clusters may be relevant for some teams and lead to more precise results
- Incorporating less common pass coverages or more specific coverages
- Looking into blitz disguises (and fake blitzes)
Acknowledgments
Thank you to the NFL analytics community and the hosts of the Big Data Bowl competition for engaging with fans and data enthusiasts in such a cool way! Also, thank you to the data engineers and architects who developed Next Gen Stats and other NFL data, providing easily accessible and usable data to the public.
Appendix
All code is available in this github repo.