Exploratory Data Analysis: What Do We Know About YouTube Channels | by Dmitrii Eliuseev

Getting statistical insights using Pandas and the YouTube Data API

Dmitrii Eliuseev
Towards Data Science
Photo by Glenn Carstens-Peters, Unsplash

Nowadays, there are more than 2.7 billion active YouTube users, and for many people, YouTube is not only entertainment but an important source of income. But how does it work? How many views or subscribers can different YouTube channels get? With the help of Python, Pandas, and the YouTube Data API, we can get some interesting insights.


This article will be divided into several parts:

  • Using the YouTube Data API. With this API, we will be able to get a list of YouTube channels for different search requests. For each channel, we will get information about the number of videos, views, and subscribers.
  • Getting the list of channels we are interested in. This can be done only once.
  • Collecting the channel data. To get statistical insights, we need to collect the data for some period of time.
  • Data analysis.

Without further ado, let’s get into it.

1. YouTube Data API

First, a piece of good news for everyone who is interested in collecting data from large networks like YouTube: the YouTube API is free, and we don’t need to pay for it. To start using this API, we need two steps:

  • Open https://console.cloud.google.com and create a new project. I already had an old project there, but after some period of inactivity, all its API limits were set to zeros, and I did not find a way to reset them. So, it’s just easier to make a new one.
Google Cloud Console, Image by author
  • Go to “APIs and Services” and enable “YouTube Data API”. Open the API, go to “Credentials,” and create an API key. If everything is done correctly, the Quotas page will look like this:
YouTube API Quotas, Image by author

Source link

Source link