Search for a command to run...
Conventional sports anticipation studies primarily rely on hypothesis-testing paradigms that target predetermined cues. However, such approaches risk overlooking unanticipated sources of predictive information. This study addresses this limitation by introducing a data-driven analysis using machine learning (ML) models as a complementary approach to conventional experimental research. Given that predictive cues embedded within movements can enhance the prediction accuracy of ML models, the proposed analysis identified spatiotemporal cues for prediction and quantified the effects of accumulating opponent-specific information across trials. Motion-capture data were collected from eight collegiate baseball pitchers, and joint-angle time series were analyzed using logistic regression models to predict pitch type (fastball vs. breaking ball). Specifically, two analyses were conducted: (1) a sliding time-window analysis to identify when and where predictive cues emerged within target motions and (2) a set-size analysis to evaluate how prediction accuracy varied with dataset size. The main results revealed that (1) predictive cues were distributed across the entire body, but models integrating whole-body information achieved the highest accuracy; (2) informative cues emerged in most body regions around the initiation of the pitcher's weight shift; (3) the accumulation of opponent-specific information had a pronounced effect up to approximately 30 pitches; and (4) substantial individual differences existed in when and which cues were effective for pitch-type prediction. These results clarify the similarities and differences between cues employed by human athletes and those utilized by ML models, thereby providing insights into athlete-specific cognitive strategies. Although alignment with human athletes must be carefully examined in future, a key theoretical contribution of this study is that it explores a complementary approach to conventional hypothesis-testing experiments by offering a time-resolved, data-driven account of where and when pitch-type-predictive information emerges in pitching kinematics.