Predicting QB Success in the NFL

Last year I wrote and submitted a paper for the MIT Sloan Sports Analytics Conference. While my abstract was accepted my paper was not. The title of my paper was Reducing Risk in the NFL Draft: Using Machine Learning Algorithms to Predict Success in the NFL. You can read the full paper here

In it I describe a decision tree model that predicts a college QBs success in the NFL. To train the model I used over 40 variables including college stats, school competitiveness, combine performance, and text mining of pro scouting reports. Ultimately, the final model used 4 variables: college win %, body mass index (BMI), college games started per season, and age. The final model was 88% accurate in predicting whether a college player would be a success or a bust in the NFL. This model can be used to predict whether the top prospects in this year's draft will be successful in the NFL.

Below is an interactive version of that final QB model.

10 comments:

  1. Where would Ryan Leaf and Peyton Manning fall in the decision tree? That would be an useful out-of-sample validation.

    ReplyDelete
    Replies
    1. Good idea. I just ran. Peyton was predicted a success: right win %, starts/season and age. Leaf was actually predicted a bust: not enough starts/season. Thanks.

      Delete
  2. This comment has been removed by the author.

    ReplyDelete
  3. Very cool. I work with researchers at Georgia Tech who did something in a similar vein: http://www.cc.gatech.edu/gvu/ii/sportvis/nfldraft/run/

    ReplyDelete
  4. How is Romo an actual bust? How is Griffin not a bust?

    ReplyDelete
    Replies
    1. Good point.

      Romo was classified as a bust. Bust status is based on approximate value (AV) and starts/season. Romo had high AV but low starts: 9.8 gms. He was right on the edge actually.

      Subjectively, its debatable if Romo was a success given his injury history. Great player, didn't play enough.

      RG3 is wrong. The data is 2 seasons old. I built the model 1.5 years ago. Today RG3 would be classified as a bust.

      See more details in the paper. Thanks.

      Delete
  5. Adam,

    Was the actual Decision Tree built in R ? Is it possible to post the R code here?

    ReplyDelete
  6. Hello!! I'am glad to read the whole content of this blog and am very excited.Thank you.
    gclub casino
    goldenslot casino
    goldenslot

    ReplyDelete

div#ContactForm1 { display: none !important; }