Hockey Recruitment with Machine Learning

By Emil Inge Markarlsson on Jan 5, 2025
ML hockey recruitment

Hockey Recruitment with Machine Learning

Traditional scouting relies on subjective evaluations and limited observations. Machine Learning gives us the ability to analyze thousands of players systematically and objectively.

The Problem with Traditional Scouting

Human Limitations:

  • Bias based on previous experience
  • Limited time to observe all players
  • Difficult to remember and compare performances over time
  • Influence of individual games rather than trend analysis

Costs:

  • Expensive travel for scouts
  • Time-intensive process
  • Risk of “group think” within the organization

My ML Approach to Recruitment

1. Player Similarity Models

I use k-nearest neighbors and clustering to find players who resemble already successful profiles:

# Simplified version of similarity scoring
def find_similar_players(target_player, player_database):
    features = ['goals_per_game', 'assists_per_game', 'corsi_for_pct', 
               'zone_starts_off_pct', 'shooting_pct']
    
    scaler = StandardScaler()
    normalized_data = scaler.fit_transform(player_database[features])
    
    similarity_scores = cosine_similarity([target_player], normalized_data)
    return similarity_scores

2. Development Trajectory Prediction

By analyzing development patterns, we can predict which young players will “break out”:

Key indicators:

  • Shot attempt rate progression
  • Deployment evolution (ice time, zone starts)
  • Linemate quality adjustments
  • Age-adjusted performance curves

3. Value vs. Cost Optimization

ML helps us find “bargains” - players who perform better than their salary/draft position indicates.

Case Study: Successful Identification

Last year I helped a team identify a 20-year-old defenseman in a European league:

Scouting report: “Not physical enough for North American hockey”

ML analysis showed:

  • Top 5% in pass completion under pressure
  • Excellent shot suppression metrics
  • Development curve similar to established NHL defensemen
  • Undervalued due to low point production

Result: Player signed for minimal cost and became a top-4 defenseman within 18 months.

Feature Engineering for Recruitment

Traditional stats are just the beginning. Important ML features include:

Contextual Adjustments

  • League quality adjustments
  • Age-relative performance
  • Linemate/teammate quality impact
  • Situational usage (PP, PK, even strength)

Advanced Metrics

  • Expected goal rates (both for and against)
  • Shot quality metrics
  • Puck possession indicators
  • Transition play efficiency

Injury Risk Factors

  • Historical injury patterns
  • Playing style risk indicators
  • Workload sustainability metrics

Implementation for Organizations

Step 1: Data Collection & Cleaning

  • Integrate multiple data sources
  • Standardize metrics across leagues
  • Historical performance database

Step 2: Model Development

  • Train on your organization’s successful picks
  • Validate against missed opportunities
  • Continuous model improvement

Step 3: Scout Integration

  • ML as complement, not replacement
  • Prioritization of scouting resources
  • Red flags for human review

ROI & Results

Organizations implementing ML-driven scouting see:

  • 25-40% improvement in draft success rate
  • Reduced scouting costs through smart resource allocation
  • Faster identification of emerging talents
  • Less bias in evaluation process

Next Steps

Ready to revolutionize your recruitment process? I help organizations implement customized ML solutions for talent identification.

Book Strategy Meeting →