1 dataset found
  1. o

    National Hockey League Interviews

    • opendatabay.com
    .undefined
    Updated Jun 25, 2025
    Share
    FacebookFacebook
    TwitterTwitter
    Email
    Click to copy link
    Link copied
    Close
    Cite
    Datasimple (2025). National Hockey League Interviews [Dataset]. https://www.opendatabay.com/data/ai-ml/6fa2e3b1-ec4d-4645-9ce2-1788f6417063
    Explore at:
    .undefinedAvailable download formats
    Dataset updated
    Jun 25, 2025
    Dataset authored and provided by
    Datasimple
    License

    CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
    License information was derived automatically

    Area covered
    Sports & Recreation
    Description

    Context This dataset was scraped from http://www.asapsports.com/, using the code in this repository. I designed the webscraping code to account for most of the variance in the website's formatting, but some webpages with formatting that differed significantly were ignored. While manually inspecting random rows of the dataset I did not notice any glaring errors in the transcripts, but I cannot guarantee that there aren't any.

    Content RowId: A unique row identifier

    team1 and team2: The two teams in the Stanley Cup Final. Whether a team is team1 or team2 has no meaning: it's determined by the order of their listing on the website.

    date: The date of the interview

    name: The person being interviewed

    job: Takes values "player", "coach", and "other". If they are a player or coach at the time of the interview they are assigned accordingly. Otherwise they are assigned "other". Most of the people in the "other" category are general managers, league officials, and commentators. Some of these values were assigned automatically based on their title in a transcript (e.g. "Coach Mike Babcock"), and others were assigned manually. A possible source of error is the fact that I did not manually inspect names that appeared only once.

    text: The interview transcript. Interviewer questions were not collected, so all of the speech comes from the interviewee. Responses to questions are separated by periods. These periods will be the only punctuation in the text. Note that a likely source of error in this column is a failure to ignore an interviewer's questions.

    Related Links In other work I used this dataset to train an RNN-based Facebook Messenger chatbot to respond to messages as a hockey player might. More precisely, if you send the bot the beginning of an interview response, it will respond with a 5-sentence continuation of that response. For example, it could receive "Well you know" and respond with "Well you know we played hard out there and…". Follow this link to interact with the chatbot and this link to read the Medium article where I explain how I created the bot.

    Acknowledgements The idea to scrape interviews from ASAPSports came from this article by Mathieu Bray.

    Inspiration How do speech patterns of NHL coaches and players differ? Are coaches more positive than players? More team oriented? How have hockey interview responses changed over the years? Can we create a bot that talks like an NHL player? (See Related Links)

    License

    CC0

    Original Data Source: National Hockey League Interviews

  2. Not seeing a result you expected?
    Learn how you can add new datasets to our index.

Share
FacebookFacebook
TwitterTwitter
Email
Click to copy link
Link copied
Close
Cite
Datasimple (2025). National Hockey League Interviews [Dataset]. https://www.opendatabay.com/data/ai-ml/6fa2e3b1-ec4d-4645-9ce2-1788f6417063

National Hockey League Interviews

Explore at:
.undefinedAvailable download formats
Dataset updated
Jun 25, 2025
Dataset authored and provided by
Datasimple
License

CC0 1.0 Universal Public Domain Dedicationhttps://creativecommons.org/publicdomain/zero/1.0/
License information was derived automatically

Area covered
Sports & Recreation
Description

Context This dataset was scraped from http://www.asapsports.com/, using the code in this repository. I designed the webscraping code to account for most of the variance in the website's formatting, but some webpages with formatting that differed significantly were ignored. While manually inspecting random rows of the dataset I did not notice any glaring errors in the transcripts, but I cannot guarantee that there aren't any.

Content RowId: A unique row identifier

team1 and team2: The two teams in the Stanley Cup Final. Whether a team is team1 or team2 has no meaning: it's determined by the order of their listing on the website.

date: The date of the interview

name: The person being interviewed

job: Takes values "player", "coach", and "other". If they are a player or coach at the time of the interview they are assigned accordingly. Otherwise they are assigned "other". Most of the people in the "other" category are general managers, league officials, and commentators. Some of these values were assigned automatically based on their title in a transcript (e.g. "Coach Mike Babcock"), and others were assigned manually. A possible source of error is the fact that I did not manually inspect names that appeared only once.

text: The interview transcript. Interviewer questions were not collected, so all of the speech comes from the interviewee. Responses to questions are separated by periods. These periods will be the only punctuation in the text. Note that a likely source of error in this column is a failure to ignore an interviewer's questions.

Related Links In other work I used this dataset to train an RNN-based Facebook Messenger chatbot to respond to messages as a hockey player might. More precisely, if you send the bot the beginning of an interview response, it will respond with a 5-sentence continuation of that response. For example, it could receive "Well you know" and respond with "Well you know we played hard out there and…". Follow this link to interact with the chatbot and this link to read the Medium article where I explain how I created the bot.

Acknowledgements The idea to scrape interviews from ASAPSports came from this article by Mathieu Bray.

Inspiration How do speech patterns of NHL coaches and players differ? Are coaches more positive than players? More team oriented? How have hockey interview responses changed over the years? Can we create a bot that talks like an NHL player? (See Related Links)

License

CC0

Original Data Source: National Hockey League Interviews

Search
Clear search
Close search
Google apps
Main menu